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(54) Title: CRYSTAL 

OS 

(57) Abstract: A crystal comprising an androgen receptor ligand binding domain (AR-LBD) is provided. The crystal structures of 
the human Androgen Receptor Ligand Binding Domain (hAR-LBD) in comparison with the human Progesterone Receptor Ligand 
' — ^ Binding Domains (hPR-(hPR-LBD) complexed with the same ligand metribolone (R1881) are also provided. The three-dimensional 
structures of the hAR LBD as well as the hPR LBD show the typical nuclear receptor fold. The change of two residues in the ligand 
binding pocket (LBP) between hPR and hAR seems to be the most likely source for the specificity of the R1881 ligand binding to 
hAR LBD. The structural implications of the 14 known mutations in the LBP of the hAR LBD associated with either prostate cancer 
(PC) or the partial androgen receptor insensitivity syndrome (PAIS) or complete androgen receptor insensitivity syndrome (CAIS) 
are analysed. The effects of most of these mutants may be explained on the basis of the crystal structure. 
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CRYSTAL 

5 FIELD OF THE INVENTION 

The present invention relates to a crystal structure. 

In particular, the present invention relates to a crystal structure for a ligand binding 
10 domain (LBD). 

In particular, the present invention relates to a crystal structure for a ligand 
binding domain (LBD) optionally having a ligand which is associated therewith. 

15 In particular, the present invention relates to a crystal structure for a LBD of a 
receptor. 

More in particular, the present invention relates to a crystal structure for a LBD 
of an androgen receptor (AR-LBD) and also to a crystal structure for an AR- 
20 LBD-ligand complex. 

The structure may be used to determine androgen receptor homologues and 
information about secondary and tertiary structures of polypeptides which are as 
yet structurally uncharacterised. The structure may also be used to identify 
25 ligands which are capable of binding to the androgen receptor. Such ligands 
may be capable of acting as modulators of androgen receptor activity. 

The crystal structure of AR-LBD enables a model to be produced for androgen 
receptor activity. Thus, the present invention provides a model which can be used 
30 to understand the structural implications of the binding mechanism. 
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BACKGROUND TO THE INVENTION 

The androgen receptor (AR) is a member of the superfamily of nuclear receptors 
which includes, amongst others, the steroid receptors as well as the vitamin D, 
5 thyroid, retinoic acid receptors and the so-called orphan receptors. In addition, 
the AR is a member of a group of four closely related steroid receptors including 
the progesterone receptors (PR), the mineralocorticoid receptor and the 
glucocorticoid receptor all of which recognise the same hormone response 
element. In general, steroid receptors are comprised of five to six domains which 

10 act as ligand-activated transcription factors that control the expression of specific 
genes. The ligand binding region is located in the C terminal domain and is 
called the ligand binding domain (LBD). Binding of a ligand (such as a steroid 
hormone) to the LBD induces changes in receptor conformation that control 
transcriptional activation and repression and also regulate homo- or 

15 heterodimerisation. In the absence of ligand, these receptors repress basal gene 
expression, probably through the expression of co-repressor proteins. 

The androgen hormones and their receptors play an important role in male 
physiology and pathology. The androgen receptor binds the male sex steroids, 

20 dihydrotestosterone (DHT) and testosterone [Teutsch, 1994], and regulates genes 
for male differentiation and development. Consequently, constitutional 
mutations in the androgen receptor gene may lead to several disease states. Some 
examples of these diesease states include prostate cancer (PC) and the androgen 
insensitivity syndrome (AIS) which are capable of impairing androgen-dependent 

25 male sexual differentiation to various degrees. In addition, complete androgen 
insensitivity syndrome (CAIS) leads to an unequivocally external female 
phenotype. In contrast, partial or incomplete androgen insensitivity syndrome 
(PAIS) comprises a wide spectrum of clinical phenotypes while mild androgen 
insensitivity syndrome (MAIS), is connected to forms of undervirilisation [Bellis, 

30 1992 ]. About 50% of the mutated residues reported in the human androgen 
receptor ligand binding domain (hAR LBD) to date are found to be involved in 
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prostate cancer (PC) and in AIS [Gottlieb, 1998]. These mutations have been well 
documented in the Androgen Receptor Gene Mutations Database of the Lady 
Davis Institute for Medical Research [Gottlieb, 1998]. 

5 To date, there are a total of 20 known amino acid residues in the AR LBD which are 
involved in ligand interaction. Of these 20 amino acid residues, to date, mutations 
have been reported in 14 of the 20 amino acid residues. These mutations are 
largely in the ligand binding pocket (LBP) which is part of the AR-LBD. By way 
of example, the three mutations in the LBP of the hAR , which have been described 

10 for CAIS, these being N705S [Bellis, 1992; Pinsky, 1992], L707R [Lumbroso, 
1996] and M749V [Bellis, 1992; Jakubicza, 1992]} are recognised as substitutions 
that considerably change the size and charge properties of the respective amino acid 
side chains. However, while it is known that these amino acid substitutions result in 
a considerably change in size of the respective amino acid side chains, it is not 

15 known how this change in size alters the AR-LBD such that the local structure and 
interactions with the ligand are disturbed. Moreover, because both the structural 
implications and the effects of these known mutation have not been determined, no 
ligand binding data are available for many of the published mutations in the AR- 
LBD. 

20 

In order to develop an understanding of the structural implications of mutations 
resulting in amino acid substitutions in the AR-LBD, attempts have been made by 
workers to determine the primary, secondary and tertiary structures of the AR- 
LBD. In this regard, the LBDs of the different nuclear receptor families have 

25 been analysed and shown to share a similar fold in spite of their low (about 20%) 
sequence homology. In this respect, the receptor fold has been shown to 
comprise about 12 helices and several small (3-sheet arranged in a so-called "oc- 
helical sandwich". Up until now, this kind of fold has only been observed for the 
LBDs of nuclear receptors. However, it has also been shown that, depending on 

30 the nature of the bound ligand, which may be an agonist or an antagonist, the 
carboxyterminal helix H12 may be found in either one of two orientations. In the 
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agonist-bound conformation, helix HI 2 serves as a 'lid 1 to close the ligand- 
binding pocket (LBP), which contains the LBD, whereas in the antagonist-bound 
conformation, helix HI 2 is positioned in a different orientation thus opening the 
entrance to the LBP. 

5 

Despite the availability of information regarding the role of the helix H12 region in 
ligand binding, there is very little experimental information available about the 
structure or the role of the other helical regions (such as helices HI to Hll) with 
respect to ligand binding. By way of example, there has been a suggestion that 

10 helices H4 and H5 may be regions involved in ligand binding. However, no 
experimental information is available with respect to these helices in the AR- 
LBD. In addition, while it is thought that while about 50% of the mutated residues 
reported in the hAR LBD are found to be involved in prostate cancer (PC) and in 
AIS [Gottlieb, 1998], it is not known experimentally whether the mutations are 

15 predominantly found in the interior of the receptor protein or at the surface of the 
receptor protein. 

Structurally, it is known that the nuclear receptors, such as the androgen receptor, 
can be organised into functional modules comprising an N-terminal transcriptional 

20 activation domain, a central DNA binding domain (DBD) and a C-terminal ligand 
binding domain (LBD). During the past few years, X-ray structures have been 
published for two of the domains, the DNA-binding domain as well as for a number 
of ligand-binding domains (LBD) including LBD-ligand complexes of receptors 
such as the estrogen receptor a and p, the progesterone receptor (PR), the vitamin D 

25 receptor, the retinoic acid receptors (X: RXR, acid: RAR), the thyroid hormone 
receptor and the peroxisome proliferator-activated receptors [Moras, 1998; 
Brzozowski, 1997; Tannenbaum, 1998; Shiau, 1998; Bourguet, 1995; Renaud, 
1995; Wagner, 1995; Ribeiro, 1998; Williams, 1998; Nolte, 1998; Uppenberg, 
1998 ; Klaholz, 1998; Rochel, 1999]. 



30 



WO 01/66599 



PCT/IB01/00475 
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To date, no X-ray structures have been published for the AR-LBD either alone or 
in combination with a ligand. Although a model structure of the AR-LBD has 
been developed by Yong et al (1998), this model is based on the crystal structure 
of the RARoc LBD [Bourguet, 1995] and not on either the AR-LBD or a more 
5 closely related receptor such as a PR-LBD. In addition, no experimentally 
determined three-dimensional (3D) structure is available for a complete androgen 
receptor either alone or in combination with a ligand. Furthermore, although the 
crystal structure of the progesterone receptor (PR) LBD in complex with 
progesterone was published in 1998 by Williams and Sigler, no comparative 
10 experimental analyses have been carried out between closely related steroid 
receptors such as an androgen receptor-LBD and progesterone receptor, either 
alone or complexed with ligands in order to identify ligand specificities and/or 
ligand specific residues. 

15 SUMMARY OF THE INVENTION 

In a broad aspect the present invention relates to cystal structures of receptor 
ligand binding domains including the uses thereof. 

20 SUMMARY ASPECTS 

According a first aspect of the invention there is provided a crystal structure 
comprising an AR-LBD. 

25 In a preferred embodiment the crystal structure is a crystal structure for an AR- 
LBD. 

The structure of a crystal AR-LBD has been solved and is set forth in Table 4. 

30 In a second aspect the present invention provides a crystal structure comprising 
an AR-LBD-ligand complex. 
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In a third aspect the present invention provides a crystal structure comprising an 
AR-LBP. 

5 According to a fourth aspect of the invention, there is provided a model of at least 
part of an AR-LBD made using or comprising or depicting a crystal structure 
according to any one of the first, second and third aspects of the invention. The 
crystal structure of the first, second and third aspect of the invention and the 
model of the fourth aspect of the invention may be provided in the form of a 
1 0 computer readable medium. 

The crystals and models of earlier aspects of the invention may provide 
information about the atomic contacts involved in the interaction between the 
receptor and a known ligand, which can be used to screen for unknown ligands. 

15 

According to a fifth aspect of the invention, there is provided a method of 
screening for a ligand capable of binding an androgen receptor binding domain, 
comprising the use of a crystal structure according to any one of the first, second 
or third aspects of the invention or a model according to the fourth aspect of the 
20 invention. For example, the method may comprise the step of contacting the AR- 
LBD with a test compound, and determining if said test compound binds to said 
ligand binding domain. The method may be an in vitro method and/or an in silico 
method and/or an in vivo method. 

25 In a sixth aspect, the present invention provides a ligand identified by a screening 
method of the fifth aspect of the invention. Preferably the ligand is capable of 
modulating the activity of an AR-LBD. As mentioned above, ligands which are 
capable of modulating the activity of AR-LBDs have considerable therapeutic 
and prophylactic potential. 

30 
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In a seventh aspect, the present invention provides the use of a ligand according 
to the sixth aspect of the invention, in the manufacture of a medicament to treat 
and/or prevent a disease in a mammalian patient. There is also provided a 
pharmaceutical composition comprising such a ligand and a method of treating 
5 and/or preventing a disease comprising administering the step of administering 
such a ligand according or pharmaceutical composition to a mammalian patient. 

The crystal structures and models described above also provide information about 
the secondary and tertiary structure of AR-LBDs. This can be used to gleen ) 

10 structural information about other, previously uncharacterised polypeptides. 
Thus, according to an eighth aspect of the invention there is provided a method of 
determining the secondary and/or tertiary structures of polypeptides with 
unknown (or only partially known) structure comprising the step of using such a 
crystal or model. The polypeptide under investigation is preferably structurally 

15 or functionally related to the androgen receptor ligand binding domain. For 
example, the polypeptide may show a degree of homology over some or all parts 
of the primary amino acid sequence. Alternatively, the polypeptide may perform 
an analogous function or be suspected to show a similar binding mechanism to 
the AR-LBD. 

20 

The present invention demonstrates that the hAR-LBD crystal structure can be 
used to analyse and explain the structural implications of 14 known mutations in 
the LBP of the hAR LBD which are associated with either prostate cancer (PC), 
the partial androgen receptor insensitivity syndrome (PAIS), mild androgen 
25 receptor insensitivity syndrome (MAIS) or complete androgen receptor 
insensitivity syndrome (CAIS). 

The present invention also demonstrates that a crystal structure of an AR-LBD may 
be used to identify ligands (such as agonists/antagonists) with binding specificity 
30 for the AR LBD. In this way, compounds may be selected, improved or modified 
to improve this ligand binding interaction. 
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The present invention also provides the crystal structure of the human hAR LBD 
in complex with the ligand metribolone (R1881) and the crystal structure of the 
human hPR LBD in complex with the ligand metribolone (R1881). The 
5 provision, for the first time, of these two experimentally determined three 
dimensional (3D) crystal stuctures has facilitated a comparison to be drawn 
between the crystal structure of both receptors in complex with the same ligand. 
Up until now, it has been known from studies on model receptros that the AR- 
LBD and the PR-LBD have a number of similarities in that: 

10 

(i) they belong to the same steroid receptor subfamily; 

(ii) they share about 54% LBD sequence identity (Figure 1); and 

15 (hi) there are a number of different ligands with similar binding affinities for 
both receptors [Teutsch, 1994]. 

The present invention highlights an additional similarity between the hAR-LBD and 
hPR-LBD ligand complexes in that the three-dimensional structures of the hAR 
20 LBD as well as the hPR LBD demonstrate the typical nuclear receptor fold. 

The present invention also demonstrates some hitherto unknown, but important, 
differences between the two receptors. These include: 

25 (i) the identification of a two amino acid residue change in the ligand binding 
pocket (LBP) of the AR-LBD which is the most likely site for the specific binding 
of the Rl 881 ligand to the hAR-LBD. The AR-LBD amino acid residues are Leu 
880 and Thr 877. The corresponding PR-LBD amino acid residues are Thr 894 
and Cys 891. In addition, there are three other amino acid changes which maybe 

30 involved in binding of ligands other than R1881. The AR amino acid residues are 
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Gin 783, Met 749 and Phe 876. The PR amino acid residues are Leu 797, Leu 763 
andTyr890. 

(ii) the demonstration that the hPR LBD - R1881 complex crystallises as a 
5 dimer in the asymmetric unit whereas the hAR LBD-R1 881 complex crystallises as 

a monomeric unit. 

(iii) the demonstration that the two independent molecules in the crystal structure 
of hPR LBD - R1881 exhibit different modes of ligand binding. One orientation of 

10 R1881 in one monomer resembles that of R1881 in the hAR LBD complex, while 
in the second monomer, R1881 is orientated similar to progesterone in the hPR 
LBD - progesterone complex. 

The present invention demonstrates the surprising and unexpected findings that: 

15 

(i) the helix H6 in the AR-LBD is an a-helix. In striking contrast, no a-helix 
was found either in the model hAR-LBD in this area or in the hPR-LBD- 
progesterone complex (Molecule A) (see Figure 4) whereas in the hPR-LBD- 
progesterone complex (Molecule B), an a-helix is observed . 

20 

(ii) helices H4 and H5 and helices H10 and Hll are preferably contiguous 
helices. That is, these helices H4 and H5 and H10 and HI 1 are connected to each 
other to form 2 continuous helices rather than 4 separate helices. Accordingly, the 
a-helical sandwich structure for the AR-LBD comprises preferably 9 a-helical 

25 regions instead preferably 11 a-helices. This observation was not seen in the 
liganded PR-LBD (Williams, 1998) which comprises 10 a-helices and where 
only helices H10 and HI 1 are contiguous sequences. 

(iii) in the hAR-LBD-Rl 881 complex, the helix HI 2 is split into two shorter 
30 helical segments with 9 and 5 amino acid residues respectively. This observation 
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was not seen in the hPR LBD-R1881 complex structure although a bending of 
helix H12 was also seen. As it is known that helix H12 may influence the 
binding of antagonists and agonists, this finding may have important implications 
for ligand binding. 

5 

(iv) the demonstration that the two independent molecules in the crystal structure 
of hPR LBD - R1881 exhibit different modes of ligand binding. One orientation of 
R1881 in one monomer resembles that of R1881 in the hAR LBD complex, in the 
second monomer R1881 is orientated similar to progesterone in the hPR LBD - 
10 progesterone complex. 

The present invention is advantageous as the determination of the 3D structure of 
the AR-LBD allows the AR-LBD to be mapped. 

15 The use of the crystals stucture in conjunction with this map enables a better 
understanding of ligand specificities for the AR-LBD. 

In particular, the crystal structure of the present invention now makes it possible 
to see: 

20 

(i) not only how a ligand binds to the AR-LBD but also 

(ii) the structural reasons why a ligand binds to an AR-LBD. 

25 Using the crystal structure, these effects can not only be understood but can also 
be predicted. This improved understanding of the AR-LBD facilitates the 
identification and modification of ligands which are capable of specifically 
and/or preferentially interacting with the AR-LBD. 

30 The present invention is also advantageous as it facilitates: 
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(i) the identification and characterization of the key residues within the AR- 
LBD and a comparison with those associated with the PR-LBD. In this regard, the 
present invention demonstrates an important new finding in relation to the PR-LBD- 
progesterone complex. In this respect, Asn 705 in the AR-LBD and Asn 719 in the 

5 PR-LBD have been shown to be capable of acting as hydrogen bond partners for 
ligands, which have, for example, a hydroxyl group attached to position 17 or to a 
substituent attached to position 17 on a steroidal ligand . 

(ii) the identification and characterization of the interaction of ligands with 
10 the AR-LBD sites. 

(iii) the identification of ligands with enhanced properties capable of 
interacting with one or more residues of the LBD. These enhanced properties 
include but not limited to: (s) higher affinity, (b) improved selectivity for the AR, 

15 and/or (c) a designated degree of efficacy (agonism vs. partial agonism vs. 
antagonism vs partial antagonism). 

(iv) the design of one or more ligands which may specifically bind to an AR- 
LBD but not to a PR-LBD (ie a selective ligand). 

20 

(v) the determination of the structural effects associated with a mutation. (In 
this respect, although, many of the phenotypic traits associated with the 
characterised mutations in the androgen receptor gene are known, the structural 
implications of such mutations have not been determined). 

25 

(vi) the identification of ligands capable of overcoming the mutation/structural 
disturbance in the AR-LBD and/or LBP comprising the AR-LBD . 

(vii) the determination of ligand binding data (affinity constants etc) which 
30 have not been available for many of the published mutant receptors. 
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(viii) the implementation of an iterative drug design and/or for "reverse- 
engineering" or " de novo design" of compounds and/or "structure-based drug 
design". 

5 (ix) a detailed understanding of the structure of the LBDs receptors, such as 
the AR and PR which enables in vitro ligand binding data to be explained and 
understood. 

(x) a reduction in the length of time required to discover compounds that 
10 target the AR-LBD. 

Other aspects of the present invention are presented in the accompanying claims and 
in the following description and drawings. These aspects are presented under 
separate section headings. However, it is to be understood that the teachings under 
15 each section are not necessarily limited to that particular section heading. 

DETAILED ASPECTS OF THE INVENTION 

Unless otherwise indicated, all terms used herein have the same meaning as they 
20 would to one skilled in the art of the present invention. Practitioners are 
particularly directed to Current Protocols in Molecular Biology (Ansubel) for 
definitions and terms of the art. 

According to one aspect of the present invention, there is provided a crystal 
25 structure comprising an androgen receptor ligand binding domain (AR-LBD). 

Preferably the AR-LBD is a human AR-LBD. 

In a preferred aspect of the present invention, there is provided a crystal structure 
30 comprising a ligand binding domain (LBD) wherein the LBD is arranged in an oc- 
helical sandwich comprising preferably the a-helices HI, H3, H4, H5, H6, H7, 
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H8, H9, H10, Hll and H12; preferably two 3i 0 helices; and preferably four short 
P strands (SI, S2 5 S3 and S4) associated in two anti-parallel p-sheets; wherein the 
helices H4, H5, H10 and Hll are preferably contiguous helices; and wherein 
either helix H6 is preferably an a-helix and/or helix HI 2 comprises preferably 
5 two helical segments of preferably 9 amino acid residues and preferably 5 amino 
acid residues. 

CRYSTAL 

10 As used herein, the term "crystal" means a structure (such as a three dimensional 
(3D) solid aggregate) in which the plane faces intersect at definite angles and in 
which there is a regular structure (such as internal structure) of the constituent 
chemical species. Thus, the term "crystal" can include any one of: a solid 
physical crystal form such as an experimentally prepared crystal, a 3D model 

15 based on the crystal structure, a representation thereof such as a schematic 
representation thereof or a diagrammatic representation thereof, a data set thereof 
for a computer. 

CRYSTAL PREPARATION 

20 

The crystals of the present invention may be prepared by expressing a nucleotide 
sequence encoding the AR-LRD and PR-LBD by use of a suitable host cell and 
then crystallising the purified receptor protein. 

25 The invention also features a method for creating crystalline AR-LBD structures 
described herein. The method may utilize a polypeptide comprising an AR-LBD 
described herein to form a crystal. A polypeptide used in the method may be 
chemically synthesized in whole or in part using techniques that are well-known 
in the art. Alternatively, methods are well known to the skilled artisan to 

30 construct expression vectors containing the native or mutated AR-LBD coding 
sequence and appropriate transcriptional/translational control signals. These 
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methods include in vitro recombinant DNA techniques, synthetic techniques, and 
in vivo recombination/genetic recombination. See for example the techniques 
described in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory 
5 textbooks. (See also Sarker et al, Glycoconjugate J. 7:380, 1990; Sarker et al, 
Proc. Natl. Acad, Sci. USA 88:234-238, 1991, Sarker et al, Glycoconjugate J. 11: 
204-209, 1994; Hull et al, Biochem Biophys Res Commun 176:608, 1991 and 
Pownall et al, Genomics 12:699-704, 1992). 

10 Crystals are grown from an aqueous solution containing the purified AR-LBD 
polypeptide by a variety of conventional processes. These processes include 
batch, liquid, bridge, dialysis, vapor diffusion, and hanging drop methods. (See 
for example, McPherson, 1982 John Wiley, New York; McPherson, 1990, Eur. J. 
Biochem. 189: 1-23; Webber. 1991, Adv. Protein Chem. 41:1-36). Generally, the 

15 native crystals of the invention are grown by adding precipitants to the 
concentrated solution of the AR-LBD polypeptide. The precipitants are added at 
a concentration just below that necessary to precipitate the protein. Water is 
removed by controlled evaporation to produce precipitating conditions, which are 
maintained until crystal growth ceases. 

20 

Derivative crystals of the invention can be obtained by soaking native crystals in 
a solution containing salts of heavy metal atoms. A complex of the invention can 
be obtained by soaking a native crystal in a solution containing a compound that 
binds the AR-LBD, or they can be obtained by co-crystallizing the AR-LBD 
25 polypeptide in the presence of one or more compounds that bind to the AR-LBD. 

Once the crystal is grown it can be placed in a glass capillary tube and mounted 
onto a holding device connected to an X-ray generator and an X-ray detection 
device. Collection of X-ray diffraction patterns are well documented by those 
30 skilled in the art (See for example, Ducruix and Geige, 1992, IRL Press, Oxford, 
England). A beam of X-rays enter the crystal and diffract from the crystal. An X- 
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ray detection device can be utilized to record the diffraction patterns emanating 
from the crystal. Suitable devices include the Marr 345 imaging plate detector 
system with an RU200 rotating anode generator. 

5 Methods for obtaining the three dimensional structure of the crystalline form of a 
molecule or complex are described herein and known to those skilled in the art 
(see Ducruix and Geige). Generally, the x-ray crystal structure is given by the 
diffraction patterns. Each diffraction pattern reflection is characterized as a vector 
and the data collected at this stage determines the amplitude of each vector. The 

10 phases of the vectors may be determined by the isomorphous replacement method 
where heavy atoms soaked into the crystal are used as reference points in the X- 
ray analysis (see for example, Otwinowski, 1991, Daresbury, United Kingdom, 
80-86). The phases of the vectors may also be determined by molecular 
replacement (see for example, Naraza, 1994, Proteins 11:281-296). The 

15 amplitudes and phases of vectors from the crystalline form of an AR-LBD 
determined in accordance with these methods can be used to analyze other 
crystalline AR-LBDs. 

The unit cell dimensions and symmetry, and vector amplitude and phase 
20 information can be used in a Fourier transform function to calculate the electron 
density in the unit cell i.e. to generate an experimental electron density map. This 
may be accomplished using the PHASES package (Furey, 1990). Amino acid 
sequence structures are fit to the experimental electron density map (i.e. model 
building) using computer programs (e.g. Jones, TA. et al, Acta Crystallogr A47, 
25 100-119, 1991). This structure can also be used to calculate a theoretical electron 
density map. The theoretical and experimental electron density maps can be 
compared and the agreement between the maps can be described by a parameter 
referred to as R-factor. A high degree of overlap in the maps is represented by a 
low value R-factor. The R-factor can be minimized by using computer programs 
30 that refine the structure to achieve agreement between the theoretical and 
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observed electron density map. For example, the XPLOR program, developed by 
Brunger (1992, Nature 355:472-475) can be used for model refinement 

A three dimensional structure of the molecule or complex may be described by 
5 atoms that fit the theoretical electron density characterized by a minimum R 
value. Files can be created for the structure that defines each atom by coordinates 
in three dimensions. 

AR AND PR CONSTRUCTS 

10 

The proteins comprising the AR-LBD and PR-LBD may be produced by a host 
recombinant cell may be secreted or may be contained intracellularly depending 
on the nucleotide sequence and/or the vector used. As will be understood by 
those of skill in the art, expression vectors containing the AR and PR encoding 

15 nucleotide sequences can be designed with signal sequences which direct 
secretion of the AR and PR coding sequences through a particular prokaryotic or 
eukaryotic cell membrane. Other recombinant constructions may join the AR or 
PR encoding sequence to nucleotide sequence encoding a polypeptide domain 
which will facilitate purification of soluble proteins (Kroll DJet al (1993) DNA 

20 Cell Biol 12:441-53). Such purification facilitating domains include, but are not 
limited to, metal chelating peptides such as histidine-tryptophan modules that 
allow purification on immobilized metals (Porath J (1992) Protein Expr Purif 3 - 
.26328 1), protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension/affinity 

25 purification system (Immunex Corp, Seattle, WA). The inclusion of a cleavable 
linker sequence such as Factor XA or enterokinase (Invitrogen, San Diego, CA) 
between the purification domain and the AR and PR is useful to facilitate 
purification. 
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HOST CELLS 

A wide variety of host cells can be employed for expression of the nucleotide 
sequences encoding the AR and PR proteins of the present invention. These cells 
5 may be both prokaryotic and eukaryotic host cells. Suitable host cells include 
bacteria such as E. colt, yeast, filamentous fungi, insect cells, mammalian cells, 
typically immortalized, e.g., mouse, CHO, human and monkey cell lines and 
derivatives thereof. Preferred host cells are able to process the expression 
products to produce an appropriate mature polypeptide. Processing includes but is 
10 not limited to glycosylation, ubiquitination, disulfide bond formation and general 
post-translational modification. 

NUCLEOTIDE SEQUENCES 

15 As used herein, the term "nucleotide sequence" refers to nucleotide sequences, 
oligonucleotide sequences, polynucleotide sequences and variants, homologues, 
fragments and derivatives thereof (such as portions thereof) which comprise the 
nucleotide sequences encoding the AR-LBD and PR-LBD. The nucleotide 
sequence may be DNA or RNA of genomic or synthetic or recombinant origin 

20 which may be double-stranded or single-stranded whether representing the sense 
or antisense strand or combinations thereof. Preferably, the term nucleotide 
sequence is prepared by use of recombinant DNA techniques (e.g. recombinant 
DNA). The nucleotide sequence may include within them synthetic or modified 
nucleotides. A number of different types of modification to oligonucleotides are 

25 known in the art. These include methylphosphonate and phosphorothioate 
backbones, addition of acridine or polylysine chains at the 3 f and/or 5' ends of the 
molecule. For the purposes of the present invention, it is to be understood that 
the nucleotide sequences described herein may be modified by any method 
available in the art. Such modifications may be carried out in order to enhance 

30 the in vitro activity or life span of nucleotide sequences of the invention. 
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Preferably, the term "nucleotide sequence" means cDNA. 
FUSION PROTEINS 

5 The AR and PR proteins comprising the AR-LBD and PR-LBD of the present 
invention may also be produced as fusion proteins, for example to aid in 
extraction and purification. Examples of fusion protein partners include 
glutathione-S-transferase (GST), 6xHis, GAL4 (DNA binding and/or 
transcriptional activation domains) and p-galactosidase. It may also be 
10 convenient to include a proteolytic cleavage site between the fusion protein 
partner and the protein sequence of interest to allow removal of fusion protein 
sequences. 

AMINO ACID SEQUENCES 

15 

Preferably the fusion protein will not hinder the ligand binding activity of the 
AR-LBD and PR-LBD comprising the amino acid sequences (SEQ ID No 1 and 
SEQ ID No 3 respectively) of the present invention. 

20 Preferably AR-LBD comprises at least SEQ ID No 1, or a homologue or mutant 
thereof. 

Preferably the PR-LBD comprises at least SEQ ID No 3, or a homologue or 
mutant thereof. 

25 

CRYSTALLISATION 

After cleavage of the fusion protein, the AR-LBD and PR-LBD may be separated 
from the cleavage products by chromatographic methods. Concentration may be 
30 performed with the aid of a filtration system and the protein concentrate may be 
immediatedly used for crystallisation purposes. The protein concentrate may be 
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crystallised using, for example, the vapour diffusion method at a temperature of 
from about 1°C to about 30°C, preferably from about 4°C to about 20°C. The 
crystallisation temperature is dependent on the additives present in the protein 
solution. 

5 

Typically, the crystals comprising the AR-LBD are purified to homogeneity for 
crystallisation. Purity of the AR-LBDs may be measured by typical techniques 
such as with SDS-PAGE, mass spectrometry and hydrophobic HPLC. 

10 Preferably crystal comprises the AR-LBD or a homologue or mutant thereof. 

Preferably the crystal comprises the PR-LBD or a homologue or mutant thereof. 

Preferably the crystal is usable in X-ray crystallography techniques. Preferably the 
15 crystals used can withstand exposure to X-ray beams used to produce a 
diffraction pattern data necessary to solve the X-ray crystallographic structure. 

Preferably the crystal has a resolution determined by X-ray crystallography of from 
about 1.5 A to about 3. 5 A, preferably about 1.5 A. 

20 

Preferably the crystal has a resolution determined by X-ray crystallography of from 
about 1 .5 A to about 3.0 A. 

Preferably the crystal comprising the AR-LBD has the secondary structure 
25 presented as SEQ ID No 2, or a homologue or mutant thereof. 

The crystal may be formed from an aqueous solution comprising a purified 
polypeptide comprising an AR-LBD. 



30 



The term "purified" in reference to a polypeptide, does not require absolute purity 
such as a homogenous preparation rather it represents an indication that the 
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polypeptide is relatively purer than in the natural environment. Generally, a 
purified polypeptide is substantially free of other proteins, lipids, carbohydrates, 
or other materials with which it is naturally associated, preferably at a 
functionally significant level for example at least 97.5% pure, more preferably at 
5 least 99% pure, most preferably at least 99.5% pure. A skilled artisan can purify a 
polypeptide comprising an AR-LBD using standard techniques for protein 
purification. A substantially pure polypeptide comprising an AR-LBD will yield 
a single major band on a non-reducing polyacrylamide gel. The purity of the AR- 
LBD can also be determined by ammo-terminal amino acid sequence analysis. 

10 

The term "associate", "association" or "associating" refers to a condition of 
proximity between a moiety (i.e. chemical entity or compound or portions or 
fragments thereof), and an AR-LBD, or parts or fragments thereof (e.g. binding 
sites or domains). The association may be non-covalent i.e. where the 
15 juxtaposition is energetically favored by for example, hydrogen-bonding, van der 
Waals, or electrostatic or hydrophobic interactions, or it may be covalent. 



ANDROGEN 



20 As used herein, the term "androgen refers to any substance, natural or synthetic, 
that is able to stimulate the development of male sexual characteristics. Naturally 
occuring androgens are represented by the C i9 -steroid hormones. They are 
produced especially by the testis (such as testosterone) and also by the adrenal 
cortex, ovary and the placenta. As used herein, the term "androgen" relates to the 

25 male sex steroids, dihydrotestosterone (DHT) and testosterone [Teutsch, 1994] 
which bind to the AR-LBD and which regulate the genes for male differentiation 
and development. 
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ANDROGEN RECEPTOR 

As used herein, the term "androgen receptor (AR)" means any of the androgen- 
binding nuclear proteins that mediate the effects of androgens by regulating gene 
5 expression. The androgen receptor proteins are discrete zinc-finger proteins 
which bind discrete DNA sequences, located upstream of transcriptional start 
sites, when an AR-ligand complex is formed. The androgen receptor (AR) 
binding domain, also known as the androgen receptor ligand binding domain 
(AR-LBD), or the hormone binding domain (HBD), is in the C -terminal region. 

10 In humans, a number of variants are known that are associated with 
abnormalities, including prostate cancer (PC), testicular feminisation syndrome, 
complete androgen insensitivity syndrome (CAIS) and/or partial androgen 
insensitivity syndrome (PAIS) and/or mild androgen insensitivity syndrome 
(MAIS) which may lead to external genitalia varying between female and nearly 

15 normal male. 

As used herein, the term "androgen receptor" means the wild type androgen 
receptor or a mutant androgen receptor. 

20 WILD TYPE 

The term "wild type" refers to the phenotype that is characteristic of most of the 
members of a species occuring naturally and which contrasts with the phenotype 
of a mutant species. As used herein, the term "wild type androgen receptor" 

25 refers to the an androgen receptor comprising the amino acid sequence presented 
as SEQ ID No 1. In particular, the term "wild type androgen receptor" refers to 
the androgen receptor comprising a ligand binding pocket (LBP) wherein the 
LBP is defined by the structural co-ordinates of the AR-LBD amino acid residues 
L701; L704; N705; L707; Q711; M742; L744; M745; M749; R752; F764; Q783; 

30 M787; F876; T877; L880; F891 ; M895 or a homologue thereof. 
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MUTANT 

As used herein, the term "mutant" refers to any organism that has undergone 
mutation or that carries a mutant gene that is expressed in the phenotype of that 
5 organism. A mutation may arise due to a substitution of one nucleotide for 
another or from a deletion of a nucleotide or an insertion of a nucleotide relative 
to a referenced wild type sequence. These single nucleotide variations are 
sometimes referred to as single nucleotide polymorphisms (SNPs). Some SNPs 
may occur in protein-coding sequences, in which case, one of the polymorphic 
10 forms may give rise to the expression of a defective or other variant protein and, 
potentially, a genetic disease. Other SNPs may occur in noncoding regions. 
Some of these polymorphisms may also result in defective protein expression 
(e.g., as a result of defective splicing). Other SNPs may have no phenotypic 
effects. 

15 

As used herein, the term "mutant" refers to an androgen receptor comprising any 
one or more changes in the sequence (and/or the structural co-ordinates) and of 
the amino acid residues in the AR-LBD which interact with bound ligand 
wherein the amino acid changes in the AR-LBD may be selected from any one or 

20 more of the group of LBD amino acid residues substitutions consisting of:L701H; 
M749I; T877A; T877S; L880Q; F891L;N705S; L707R; M749V; G708A; 
G708V; M742V; M742I; M745T; V746M; R752Q; F764S; M787V. In this 
regard, the sequence and amino acid residues (such as L701H) are described 
using the one letter format for the amino acid residue (such as L), followed by the 

25 amino acid designations nunber which refers to the amino acid residue in the wild 
type sequence directly above the last digit, followed by the mutant amino acid 
residue (here a substituted amino acid residue) which is also described using the 
one letter format for the amino acid residue (in this case H). 
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For some embodiments the androgen receptor may comprise two or more 
mutated amino acid residues. An example of such an embodiment is L701H and 
T877A. 

5 The term "mutant" is not limited to the above mutations which are reflected in 
amino acid substitutions of the key amino acid residues in the AR-LBD but may 
also include and is not limited to other deletions or insertions of nucleotides in 
the wild type sequence which may result in changes in the amino acid residues in 
the deduced amino acid sequence of the AR-LBD. The term "mutant" also 

10 includes uncharacterised mutants. 

Preferably the mutated androgen receptor comprises one or more of the 
characterised mutations in the LBP of the AR-LBD as set out in Table 3. 

15 Preferably the mutated amino acid residue(s) is/are located in helices H4 and H5 
of the AR-LBD. 

Preferably the mutated amino acid residue(s) is/are evenly distributed between 
buried, medium and fully accessible amino acid residues within the ligand 
20 binding pocket (LBP) comprising the AR-LBD. 

Preferably the mutated amino acid residue(s) is/are distributed as set out in Figure 
1. 

25 STRUCTURAL CO-ORDINATES 

In a highly preferred embodiment, the crystal has the structural co-ordinates as 
provided in Table 4 (Figure 6) which may be used for the identification of a 
ligand capable of binding to the AR-LBD. 



30 



WO 01/66599 



PCT/IB01/00475 



24 

As used herein, the term "structural co-ordinates" refer to a set of values that 
define the position of one or more amino acid residues with reference to a system 
of axes. The term refers to a data set that defines the three dimensional structure 
of a molecule or molecules (e.g. Cartesian coordinates, temperature factors, and 

5 occupancies). Structural coordinates can be slightly modified and still render 
nearly identical three dimensional structures. A measure of a unique set of 
structural coordinates is the root-mean-square deviation of the resulting structure. 
Structural coordinates that render three dimensional structures (in particular a 
three dimensional structure of an SGC domain) that deviate from one another by 

10 a root-mean-square deviation of less than 5 A, 4 A, 3 A, 2 A, or 1.5 A may be 
viewed by a person of ordinary skill in the art as very similar. 

According to one aspect of the present invention, there is provided a crystal 
comprising a complex between an androgen receptor ligand-binding domain and 
15 a ligand. In other words the androgen receptor ligand binding domain may be 
associated with a ligand in the crystal. The ligand may be any compound which 
is capable of interacting stably and specifically with the androgen receptor ligand 
binding domain. The ligand may, for example, be an inhibitor of the AR-LBD. 

20 LIGAND-BINDING DOMAIN 

As used herein, the term "ligand binding domain (LBD)" means the C-terminal 
ligand binding region of a steroid receptor which is responsible for ligand 
binding. The term "ligand binding domain (LBD)" also includes a homologue of 
25 the ligand binding domain or a portion thereof. The LBD of the present invention 
comprises a ligand binding pocket (LBP).With reference to the crystal of the 
present invention residues in the LBD may be defined by their spatial proximity 
to the ligand in the crystal structure. The term "ligand binding domain (LBD)" 
also includes a homologue of the ligand binding domain or a portion thereof. 

30 
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As used herein, the term "portion thereof means the structural co-ordinates 
corresponding to a sufficient number of amino acid residues of AR-LBD (or 
homologues thereof) that are capable of interacting with a test compound capable 
of binding to the LBD. This term includes AR- ligand binding domain amino 
5 acid residues having an amino acid residues from about 4A to about 5 A of a 
bound compound or fragment thereof. Thus, for example, the structural co- 
ordinates provided in the crystal structure may contain a subset of the amino acid 
residues in the LBD which may be useful in the modelling and design of 
compounds that bind to the LBD. 

10 

The ligand binding domain may be defined by its association with the ligand. 

Preferably the ligand binding domain comprises one or more amino acid residues 
as determined from the crystal structure or a homologue thereof. Examples of 
15 such amino acid residues are presented herein. 

LIGAND BINDING POCKET (LBP) 

According to one aspect of the present invention, there is provided a crystal 
20 structure comprising a ligand binding pocket (LBP); wherein the LBP is defined 
by the following amino acid residue structural co-ordinates: L701; L704; N705; 
L707; Q711; M742; L744; M745; M749; R752; F764; Q783; M787; F876; T877; 
L880; F891; M895; or a homologue thereof. 

25 As used herein, the term "ligand binding pocket (LBP)" refers to the cavity or 
hollow in a structure - typically a three-dimensional (3D) structure - in which a 
ligand binds and in which is located the ligand binding domain (LBD). The LBP 
is sometimes referred to as a "binding niche". In particular, preferaby, the term 
AR-LBP refers to the 18-20 known amino acid residues in the hAR-LBD which 

30 are known to interact with bound ligand (either R1881 or progesterone). These 
residues are highlighted in Figure 1 and included in Figure 4. Most of these 
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residues are hydrophobic and interact mainly with the steroid scaffold, while a 
few are polar and may form hydrogen bonds to the polar atoms in the ligand. 



POLAR AMINO ACIDS 

5 

As used herein, the term "polar" includes positively and negatively charged 
amino acids. In this respect, negatively charged amino acids include aspartic acid 
(D) and glutamic acid (E); positively charged amino acids include lysine (K) and 
arginine (R); and amino acids with uncharged polar head groups having similar 
10 hydrophilicity values include leucine (L), isoleucine (I), valine (V), glycine (G), 
alanine (A), asparagine (N), glutamine (Q), serine (S), threonine (T), 
phenylalanine (F), and tyrosine (Y). The classification of these amino acid 
residues is set out in the Table below. 



15 HOMOLOGUE 



As used herein, the term "homologue" refers to an AR-LBD or a portion thereof 
which may have deletions, insertions or substitutions of amino acid residues as 
long as the binding specificity of the AR-LBD is retained. In this regard, 
20 deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues as long as the binding specificity of the AR- 
LBD is retained. Here, a conservative substitution which may produce a silent 
change which may result in a functionally equivalent AR-LBD. 

25 

As used herein, the term "homologue" also means a homologue of the crystal 
structure of the AR-LBD wherein the homologue has a root mean square (r.m.s) 
deviation from the backbone atoms of amino acid residues in secondary structural 
elements of less than 3.0A. Preferably the r.m.s deviation from the backbone 
30 atoms of amino acid residues in the secondary structural elements is less than 
2.0A. 
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ALIPHATIC 


Non-polar 


GAP 




ILV 




Polar - uncharged 


CSTM 






NQ ! 




Polar - charged 


DE 




KR 


AROMATIC 




HFWY 



Abbreviations for amino acid residues are the standard 3 -letter and/or 1 -letter 
codes used in the art to refer to one of the 20 common L-amino acids. 

5 

SECONDARY STRUCTURE 

The AR-LBD of the present invention is arranged in an a-helical sandwich. The 
AR-LBD comprises preferably eleven a-helices (HI, H3, H4, H5, H6, H7, H8, 

10 H9, H10, HI 1, H12). There is no H2 helix. Because both helices H4 and H5 and 
helices Hll and HI 2 are contiguous helices, the a-helical sandwich is regarded 
as comprising 9 a-helices and not 1 1 a-helices. The a- helices designated by the 
letter H in Figure 1. The helix number (such as HI) is indicated in black above 
the relevant helical sequence. The a-helical sandwich fold may further comprise 

15 preferably 3io helices and preferably four short p strands (SI, S2, S3 and S4) 
associated in two anti-parallel p-sheets. The p strands are indicated by the letter E 
in Figure 1. The strand number (such as SI) is indicated in black above the 
relevant p sheet. 

20 ALPHA HELIX (a-Helix) 

As used herein, the term "a-helix" means a helical or spiral configuration of a 
polypeptide chain in which successive turns of the helix are held together by 
hydrogen bonds between the amide (peptide) links, the carbonyl group of any 
25 given residue being hydrogen-bonded to the imino group of the third residue 
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behind it in the chain. This is the case for all of the caxbonyl and amide groups of 
the peptide bonds of the main chain. Typically, the a-helix has 3, 6 residues per 
turn and the translation or pitch along the helical axis is 1.5A per residue and 
5. 4 A per turn. The helix may be left- or right-handed, the latter being much more 
5 common. The a-helix is one of the two basic elements of the secondary structure 
adopted by the polypeptide chain within the hydrophobic core of a globular 
protein. The other basic element is the {3 strand. 

The AR-LBD of the present invention comprises a helix in the region of helix H6 
10 which is an a-helix. 

The AR-LBD of the present invention comprises contiguous helices. In this 
respect, helices H4 and H5 and helices H10 and HI 1 are contiguous. In contrast 
only the H10 and Hll sequences of the progesterone receptor were found to be 
15 contiguous (see Williams 1998). 

CONTIGUOUS 

As used herein, the term "contiguous helices" means helices which are connected 
20 to each other such as connected in line with each other. 

BETA SHEET (p-SHEET) and BETA STRANDS (p STRANDS) 

As used herein, the term "beta sheet (P -sheet) structure means a combination of 
25 several regions of a polypeptide chain. In contrast, the a helix, is built up from 
one continuous region. These regions, p strands, are usually from 5 to 10 
residues long and are in an almost fully extended conformation with <|), \|/ angles 
within the broad structurally allowed region in the upper left quadrant of the 
Ramachandran plot. These p strands are aligned adjacent to each other such that 
30 hydrogen bonds can form between CO groups of one p strand and NH groups on 
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an adjacent p strand and vice versa. The p sheets that are formed from several 
such p strands are "pleated" with C a atoms successively a little above and below 
the plane of the p sheet. The side chains follow this pattern such that within a p 
strand they also point alternatively above and below the p sheet. 

5 

PARALLEL AND ANTI-PARALLEL p-SHEETS 

P strands can interact in two ways to form a pleated sheet. Either the amino acids 
in the aligned p strands can all run in the same biochemical direction, amino 

10 terminal to carboxy terminal, in which case the sheet is described as parallel, or 
the amino acids in successive strands can have alternating directions, amino 
terminal to carboxy terminal followed by carboxy terminal to amino terminal, 
followed by amino terminal to carboxy terminal, and so on, in which case the 
sheet is called antiparallel. Each of the two forms has a distinctive pattern of 

15 hydrogen bonding. The antiparallel p sheet has narrowly spaced hydrogen bond 
pairs that alternate with widely spaced pairs. Parallel p sheets have evenly 
spaced hydrogen bonds that angle across between the P strands. Within both 
types of p sheets all possible main chain hydrogen bonds are formed, except for 
the two flanking strands of the P sheet that only have one neighboring p strand. 

20 

The AR-LBD of the present invention comprises beta strands (P strands), 
designated by the letter E, which are make up sheets. These strands (SI, S2, S3 
and S4) are arranged in the order in which they appear in the secondary structure 
as set out in Figure 1. These strands are arranged in two p-sheets. 

25 

KEY RESIDUES 

As used herein the term "key residues" refers to one or more amino acid residues 
in an AR-LBD, capable of modulating ligand binding. The residues may be any 
30 one of the key residues within the AR-LBD as described herein or mutants 
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thereof or they may be residues with homology to the residues or mutants thereof. 
The key amino acid residues of the AR-LBD may be any one or more of the 
amino acid residues selected from the group consisting of: L701; L704; N705; 
L707; Q711; M742; L744; M745; M749; R752; F764; Q783; M787; F876; T877; 
5 L880; F89 1 ; M895 or a homologue or mutant thereof. 

CONFORMATIONALLY CONSTRAINED RESIDUES 

Preferably binding of the ligand to the AR-LBD causes conformational changes 
10 to the AR-LBD thereby inhibiting further binding thereto. 

Preferably the ligand produced in accordance with the invention fills at least the 
LBP of the AR without perturbing the remainder of the AR structure. 

Preferably the ligand interacts with conformationally constrained residue of the 
15 AR-LBD. 

As used herein, the term "conformationally constrained residue" refers to a 
residue, such as an amino acid residue whose binding properties may be 
modulated through a mutation in that residue. The mutation in the amino acid 
residue may result in a change in the conformation of that residue. In particular, 
20 the mutation may result in a restricted/constrained conformation which may 
affect the interaction of a ligand with the hAR-LBD. 

BINDING AFFINITY 

25 Preferably the ligands of the present invention bind more effectively to the AR- 
LBD than androgen. 

Preferably the ligands of the present invention bind with twice the binding 
affinity of androgen. 

30 
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Preferably the ligands of the present invention bind with three times the affinity 
of androgen. 

Preferably the ligands of the present invention bind with ten or more times the 
5 affinity of androgen. 

Preferably the improvements in the interaction of a ligand with the AR-LBD are 
manifested as increases in binding affinity but may also include increases in 
receptor selectivity and/or modulation of efficacy. 

10 

Preferably the ligand inhibits the action of androgen and androgen mimetics by 
binding tightly to the AR-LBD but by not up-regulating androgen receptor gene 
expression. 

15 MODEL 

One aspect of the present invention is related to a model. 

The crystal structure of the present invention can be used to generate a structural 
model such as a three dimensional (3D) structural model (or a representation 
thereof) comprising an AR-LBD or portion thereof. Alternatively, the crystal 
20 structure may be used to generate a computer model for the structure. 

Preferably the crystal model comprising the AR-LBD is built from all or part of 
the X-ray diffraction data presented in Table 1 and/or the refinement statistics 
presented in Table 2. 

25 

Preferably the crystal model comprising the AR-LBD is built from all or part of 
the crystal co-ordinate data as shown in Table 4 (see Figure 6). 

Thus, for example, the structural co-ordinates provided in the crystal. structure 
30 and/or model structure may comprise the amino acid residues of the AR-LBD, or 
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a portion of the AR-LBD or a homologue thereof useful in the modelling and 
design of test compounds capable of binding to the AR-LBD. 

As used herein, the term "modelling" includes the quantitative and qualitative 
5 analysis of molecular structure and/or function based on atomic structural 
information and interaction models. The term "modelling" includes conventional 
numeric-based molecular dynamic and energy minimization models, interactive 
computer graphic models, modified molecular mechanics models, distance 
geometry and other structure-based constraint models. 

10 

In another aspect of the present invention, the structural coordinates comprising 
the AR-LBD or a portion thereof may be applied to a model screening system. 

As used herein, the term "model screening system" may be a solid 3D screening 
15 system or a computational screening system. Using this model, Test compounds 
can be modelled that fit spatially and preferentially into the AR-LBD. 

In one preferred aspect, the test compounds are positioned in the AR-IBD 
through computational docking. 

20 

In another preferred aspect, the test compounds are positioned in the AR-BD 
through manual docking. 

As used herein, the term "fits spatially" means that the three-dimensional 
25 structure of a ligand is accommodated geometrically in a cavity or pocket of an 
AR-IBD. 

Preferably, modelling is performed using a computer and may be further 
optimized using known methods. This is called modelling optimisation. Overlays 
30 and super positioning with a three dimensional model of the AR-LBD, and/or a 
portion thereof, can also be used for modelling optimisation. 
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Alignment and/or modelling can be used as a guide for the placement of 
mutations on the AR-LBD surface to characterise the nature of the site in the 
context of a cell. 

5 

The structure coordinates of an AR-LBD structure described herein can be used 
as a model for determining the secondary or three-dimensional structures of 
additional native or mutated AR-LBD with unknown structure, as well as the 
structures of co-crystals of AR-LBD with compounds such as substrates and 
10 modulators (e.g. stimulators or inhibitors). The structure coordinates and models 
of an AR-LBD structure can also be used to determine solution-based structures 
of native or mutant AR-LBD. 

Secondary or three-dimensional structure may be determined by applying the 
15 structural coordinates of an AR-LBD structure to other data such as an amino 
acid sequence, X-ray crystallographic diffraction data, or nuclear magnetic 
resonance (NMR) data. Homology modeling, molecular replacement, and nuclear 
magnetic resonance methods using these other data sets are described below. 

20 Homology modeling (also known as comparative modeling or knowledge-based 
modeling) methods develop a three dimensional model from a polypeptide 
sequence based on the structures of known proteins (e.g. native or mutated AR- 
LBD). In the present invention the method utilizes a computer representation of 
an AR-LBD structure or a complex of same, a computer representation of the 

25 amino acid sequence of a polypeptide with an unknown structure (additional 
native or mutated AR-LBD), and standard computer representations of the 
structures of amino acids. The method in particular comprises the steps of; (a) 
identifying structurally conserved and variable regions in the known structure; (b) 
aligning the amino acid sequences of the known structure and unknown structure 

30 (c) generating coordinates of main chain atoms and side chain atoms in 
structurally conserved and variable regions of the unknown structure based on the 
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coordinates of the known structure thereby obtaining a homology model; and (d) 
refining the homology model to obtain a three dimensional structure for the 
unknown structure. This method is well known to those skilled in the art (Greer, 
1985, Science 228, 1055; Bundell et al 1988, Eur. J. Biochem. 172, 513; 
5 Knighton et al., 1992, Science 258:130-135, 

http://biochem.vt.edu/courses/modeling/homology.htn). Computer programs that 
can be used in homology modeling are Quanta and the Homology module in the 
Insight II modeling package distributed by Molecular Simulations Inc, or 
MODELLER (Rockefeller University, www.iucr.ac;uk/sinris-top/loRical/prg- 
10 modeller ..html) . 

In step (a) of the homology modeling method, the known AR-LBD structure is 
examined to identify the structurally conserved regions (SCRs) from which an 
average structure, or framework, can be constructed for these regions of the 
15 protein. Variable regions (VRs), in which known structures may differ in 
conformation, also must be identified. SCRs generally correspond to the elements 
of secondary structure, such as alpha-helices and beta-sheets, and to ligand- and 
substrate-binding sites (e.g. acceptor and donor binding sites). The VRs usually 
lie on the surface of the proteins and form the loops where the main chain turns. 

20 

Many methods are available for sequence alignment of known structures and 
unknown structures. Sequence alignments generally are based on the dynamic 
programming algorithm of Needleman and Wunsch [J. Mol. Biol. 48: 442-453, 
1970]. Current methods include FASTA, Smith- Waterman, and BLASTP, with 

25 the BLASTP method differing from the other two in not allowing gaps. Scoring 
of alignments typically involves construction of a 20x20 matrix in which 
identical amino acids and those of similar character (i.e., conservative 
substitutions) may be scored higher than those of different character. 
Substitution schemes which may be used to score alignments include the scoring 

30 matrices PAM (Dayhoff et al., Meth. Enzymol. 91: 524-545, 1983), and 
BLOSUM (Henikoff and Henikoff, Proc. Nat. Acad. Sci. USA 89: 10915-^0919, 
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1992), and the matrices based on alignments derived from three-dimensional 
structures including that of Johnson and Overington (JO matrices) (J. Mol. Biol. 
233: 716-738, 1993). 

5 Alignment based solely on sequence may be used; however, other structural 
features also may be taken into account. In Quanta, multiple sequence alignment 
algorithms are available that may be used when aligning a sequence of the 
unknown with the known structures. Four scoring systems (i.e. sequence 
homology, secondary structure homology, residue accessibility homology, CA- 
10 CA distance homology) are available, each of which may be evaluated during an 
alignment so that relative statistical weights may be assigned. 

When generating coordinates for the unknown structure, main chain atoms and 
side chain atoms, both in SCRs and VRs need to be modeled. A variety of 

15 approaches known to those skilled in the art may be used to assign coordinates to 
the unknown. In particular, the coordinates of the main chain atoms of SCRs will 
be transferred to the unknown structure. VRs correspond most often to the loops 
on the surface of the polypeptide and if a loop in the known structure is a good 
model for the unknown, then the main chain coordinates of the known structure 

20 may be copied. Side chain coordinates of SCRs and VRs are copied if the residue 
type in the unknown is identical to or very similar to that in the known structure. 
For other side chain coordinates, a side chain rotamer library may be used to 
define the side chain coordinates. When a good model for a loop cannot be found 
fragment databases may be searched for loops in other proteins that may provide 

25 a suitable model for the unknown. If desired, the loop may then be subjected to 
conformational searching to identify low energy conformers if desired. 

Once a homology model has been generated it is analyzed to determine its 
correctness. A computer program available to assist in this analysis is the Protein 
30 Health module in Quanta which provides a variety of tests. Other programs that 
provide structure analysis along with output include PROCHECK and 3D- 
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Profiler [Luthy R. et al, Nature 356: 83-85, 1992; and Bowie, J.U. et al, Science 
253: 164-170, 1991]. Once any irregularities have been resolved, the entire 
structure may be further refined. Refinement may consist of energy minimization 
with restraints, especially for the SCRs. Restraints may be gradually removed for 
5 subsequent minimizations. Molecular dynamics may also be applied in 
conjunction with energy minimization. 

Using the structure coordinates of the crystal complexes provided by this 
invention, molecular replacement may be used to determine the structure 
10 coordinates of a crystalline mutant or homologue of AR-LBD or of a related 
protein. 

Molecular replacement involves applying a known structure to solve the X-ray 
crystallographic data set of a polypeptide of unknown structure (e.g. native or 

15 mutated AR-LBD). The method can be used to define the phases describing the 
X-ray diffraction data of a polypeptide of unknown structure when only the 
amplitudes are known. Commonly used computer software packages for 
molecular replacement are X-PLOR (Brunger 1992, Nature 355: 472-475), 
AMoRE (Navaza, 1994, Acta Crystallogr. A50:157-163), the CCP4 package 

20 (Collaborative Computational Project, Number 4, "The CCP4 Suite: Programs for 
Protein Crystallography", Acta Cryst, Vol. D50, pp. 760-763, 1994), and the 
MERLOT package (P.M.D. Fitzgerald, J. Appl. Cryst, Vol. 21, pp. 273-278, 
1988). It is preferable that the resulting structure not exhibit a root-mean-square 
deviation of more than 3 A. 

25 

Molecular replacement computer programs generally involve the following steps: 
(1) determining the number of molecules in the unit cell and defining the angles 
between them (self rotation function); (2) rotating the known structure (e.g. AR- 
LBD) against diffraction data to define the orientation of the molecules in the unit 
30 cell (rotation function); (3) translating the known structure in three dimensions to 
correctly position the molecules in the unit cell (translation function); (4) 
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determining the phases of the X-ray diffraction data and calculating an R-factor 
calculated from the reference data set and from the new data wherein an R-factor 
between 30-50% indicates that the orientations of the atoms in the unit cell have 
been reasonably determined by the method; and (5) optionally, decreasing the R- 
5 factor to about 20% by refining the new electron density map using iterative 
refinement techniques known to those skilled in the art (refinement). 

In an embodiment of the invention, a method is provided for determining three 
dimensional structures of polypeptides with unknown structure (e.g. additional 
10 native or mutated AR-LBD) by applying the structural coordinates of an AR- 
LBD structure to provide an X-ray crystallographic data set for a polypeptide of 
unknown structure, and (b) determining a low energy conformation of the 
resulting structure. 

15 The structural coordinates of an AR-LBD structure may be applied to nuclear 
magnetic resonance (NMR) data to determine the three dimensional structures of 
polypeptides (e.g. additional native or mutated AR-LBD). (See for example, 
Wuthrich, 1986, John Wiley and Sons, New York: 176-199; Pflugrath et al., 
1986, J. Molecular Biology 189: 383-386; Kline et al., 1986 J. Molecular Biology 

20 1 89:377-382). While the secondary structure of a polypeptide may often be 
determined by NMR data, the spatial connections between individual pieces of 
secondary structure are not as readily determined. The structural coordinates of a 
polypeptide defined by X-ray crystallography can guide the NMR spectroscopist 
to an understanding of the spatial interactions between secondary structural 

25 elements in a polypeptide of related structure. Information on spatial interactions 
between secondary structural elements can greatly simplify Nuclear Overhauser 
Effect (NOE) data from two-dimensional NMR experiments. In addition, 
applying the structural coordinates after the determination of secondary structure 
by NMR techniques simplifies the assignment of NOE's relating to particular 

30 amino acids in the polypeptide sequence and does not greatly bias the NMR 
analysis of polypeptide structure. 
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This, in turn, can be subject to any of the several forms of refinement to provide a 
final, accurate structure of the unknown crystal. Lattman, E., "Use of the Rotation 
and Translation Functions", in Methods in Enzymology, 115, pp. 55-77 (1985); 
5 M. G. Rossmann, ed., "The Molecular Replacement Method", Int. Sci. Rev. Ser., 
No. 13, Gordon & Breach, New York, (1972). 

Other molecular modelling techniques may also be employed in accordance with 
this invention. See, e.g., Cohen, N. C. et al 9 "Molecular Modelling Software and 
10 Methods for Medicinal Chemistry", J. Med. Chem., 33, pp. 883-894 (1990). See 
also, Navia, M. A. and M. A. Murcko, "The Use of Structural Information in 
Drug Design", Current Opinions in Structural Biology, 2, pp. 202-210 (1992). 

The present invention also relates to a method of screening for a ligand capable 
15 of binding to the AR-LBD and/or which are capable of modulating the binding 
capacity of the AR-LBD wherein said method comprises the use of the crystal or 
model according to the invention. 

The method may employ a solid 3D screening system or a computational 
screening system. Using these systems, test compounds may be screened to find 
20 those which interact spatially and preferentially with the AR-LBD, through either 
computational or manual docking. 

TEST COMPOUNDS 

25 In one aspect, the invention relates to a method of screening for a ligand capable 
of binding to an AR-LBD, wherein the AR-LBD is defined by the amino acid 
residue structural coordinates given above, the method comprising contacting the 
AR-LBD with a test compound and determining if said test compound binds to 
said AR-LBD. 

30 
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As used herein, the term "test compound" includes, but is not limited to, a 
compound which may be obtainable from or produced by any suitable source, 
whether natural or not. The test compound may be designed or obtained from a 
library of compounds which may comprise peptides, as well as other compounds, 

5 such as small organic molecules and particularly new lead compounds. By way 
of example, the test compound may be a natural substance, a biological 
macromolecule, or an extract made from biological materials such as bacteria, 
fungi, or animal (particularly mammalian) cells or tissues, an organic or an 
inorganic molecule, a synthetic test compound, a semi-synthetic test compound, a 

10 structural or functional mimetic, a peptide, a peptidomimetics, a derivatised test 
compound, a peptide cleaved from a whole protein, or a peptides synthesised 
synthetically (such as, by way of example, either using a peptide synthesizer or 
by recombinant techniques or combinations thereof, a recombinant test 
compound, a natural or a non-natural test compound, a fusion protein or 

15 equivalent thereof and mutants, derivatives or combinations thereof. 

MODULATING 

The term "modulating" means inducing an increase or a decrease in the activity 
20 of the androgen receptor through binding of a test compound to an AR-LBD. 
The term also encompasses removal of the activity of the receptor. 

MIMETIC 

25 As used herein, the term "mimetic" relates to any chemical which includes, but is 
not limited to, a peptide, polypeptide, antibody or other organic chemical which 
has the same qualitative activity or effect as a known test compound. That is, the 
mimetic is a functional equivalent of a known test compound (such as a known 
ligand capable of binding to the AR-LBD). 

30 
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DERIVATIVE 

The term "derivative" or "derivatised" as used herein includes chemical 
modification of a test compound. Illustrative of such chemical modifications 
5 would be replacement of hydrogen by a halo group, an alkyl group, an acyl group 
or an amino group. 

Typically the test compound will be prepared by recombinant DNA techniques 
and/or chemical synthesis techniques. 

10 

Once a test compound capable of interacting with a key amino acid residue in the 
AR-LBD has been identified, further steps may be carried out either to select 
and/or to modify compounds and/or to modify existing compounds, to modulate 
the interaction with the key amino acid residues in the AR-LBD. 

15 

BIOLOGICAL SCREENS 

Test compounds and ligands which are identified with the crystal of the present 
invention can be screened in assays such as are well known in the art. Screening 

20 can be, for example in vitro, in cell culture, and/or in vivo. Biological screening 
assays preferably center on activity-based response models, binding assays 
(which measure how well a compound binds to the receptor), and bacterial, yeast 
and animal cell lines (which measure the biological effect of a compound in a 
cell). The assays can be automated for high capacity-high throughput screening 

25 (HTS) in which large numbers of compounds can be tested to identify 
compounds with the desired activity. The biological assay, may also be an assay 
for ligand binding activity a compound that selectively binds to the LBD 
compared to other nuclear receptors. 
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In one embodiment, the present invention provides a method of screening for a 
test compound capable of interacting with a key amino acid residue of the AR~ 
LBD . 



5 Another preferred aspect of the invention provides a process comprising the steps 
of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to a ligand binding 
domain; and 

10 (c) preparing a quantity of said one or more ligands. 

A further preferred aspect of the invention provides a process comprising the 
steps of: 

(a) performing the method of screening for a ligand as described above; 
15 (b) identifying one or more ligands capable of binding to an AR-LBD; and 

(c) preparing a pharmaceutical composition comprising said one or more 
ligands. 



Yet another preferred aspect of the invention provides a process comprising the 
20 steps of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to an AR-LBD; 

(c) modifying said one or more ligands capable of binding to an AR-LBD; 

(d) performing said method of screening for a ligand as described above; 

25 (e) optionally preparing a pharmaceutical composition comprising said one or 
more ligands. 



Thus, the structural information from the crystal structure of the present invention 
is useful in the design of potential ligands capable of interacting with the AR- 
30 LBD and/or capable of modulating the DNA binding capacity of the AR-LBD, and 
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the models of the present invention are useful to examine the effect such a ligand 
is likely to have on the structure and/or function of the AR-LBD. 

In one aspect the present invention relates to a ligand identified using such 
5 screening methods. 

LIGAND 

As used herein, the term "ligand" refers to a test compound capable of binding to 
10 one or more key residues in the LBD. Such a ligand may also be referred to as an 
androgen receptor binding compound. Preferably the ligand is capable of 
modulating the activity of AR-LBD. 

IDENTIFICATION OF MODULATORS OF AR-LBD 

15 

Modulators (e.g. inhibitors) of a AR-LBD may be designed and identified that 
may modify a AR-LBD involved in a clinical disorder. The rational design and 
identification of modulators of AR-LBD can be accomplished by utilizing the 
atomic structural coordinates that define an AR-LBD structure, or a part thereof. 

20 Structure-based modulator design identification methods are powerful techniques 
that can involve searches of computer data bases containing a variety of potential 
modulators and chemical functional groups. (SeeKuntz et al., 1994, Acc. Chem. 
Res. 27:117; Guida, 1994, Current Opinion in Struc. Biol. 4: 777; and Colman, 
1994, Current Opinion in Struc. Biol. 4: 868, for reviews of structure-based drug 

25 design and identification;and Kuntz et al 1982, J. Mol. Biol. 162:269; Kuntz et 
al., 1994, Acc. Chem. Res. 27: 117; Meng et aL, 1992, J. Compt. Chem. 13: 505; 
Bohm, 1994, J. Comp. Aided Molec. Design 8: 623 for methods of structure- 
based modulator design). 

30 The AR-LBD structures, and parts thereof described herein, and the structures of 
other polypeptides determined by the homology modeling, molecular 
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replacement, and NMR techniques described herein can also be applied to 
modulator design and identification methods. 

Modulators of AR-LBD may be identified by docking the computer 
5 representation of compounds from a data base of molecules. Data bases which 
may be used include ACD (Molecular Designs Limited), NCI (National Cancer 
Institute), CCDC (Cambridge Crystallographic Data Center), CAST (Chemical 
Abstract Service), Derwent (Derwent Information Limited), Maybridge 
(Maybridge Chemical Company Ltd), Aldrich (Aldrich Chemical Company), 
10 DOCK (University of California in San Francisco), and the Directory of Natural 
Products (Chapman & Hall). Computer programs such as CONCORD (Tripos 
Associates) or DB-Converter (Molecular Simulations Limited) can be used to 
convert a data set represented in two dimensions to one represented in three 
dimensions. 

15 

The computer programs may comprise the following steps: 

(a) docking a computer representation of a structure of a compound into a 
computer representation of an AR-LBD defined in accordance with the 
invention using the computer program, or by interactively moving the 

20 representation of the compound into the representation of the binding site; 

(b) characterizing the geometry and the complementary interactions formed 
between the atoms of the binding site and the compound; optionally 

(c) searching libraries for molecular fragments which can fit into the empty 
space between the compound and binding site and can be linked to the 

25 compound; and 

(d) linking the fragments found in (c) to the compound and evaluating the 
new modified compound. 

Methods are also provided for identifying a potential modulator of an AR-LBD 
30 function by docking a computer representation of a compound with a computer 
representation of a structure of an AR-LBD that is defined by atomic interactions, 
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atomic contacts, or atomic structural coordinates described herein. In an 
embodiment the method comprises the following steps: 

(a) docking a computer representation of a compound from a computer data 
base with a computer representation of a selected site (e.g. the inhibitor 

5 binding site) on a AR-LBD structure defined in accordance with the 

invention to obtain a complex; 

(b) determining a conformation of the complex with a favourable geometric 
fit and favourable complementary interactions; and 

(c) identifying compounds that best fit the selected site as potential 
10 modulators of the AR-LBD. 



"Docking" refers to a process of placing a compound in close proximity with an 
active site of a polypeptide (i.e. an AR-LBD), or a process of finding low energy 
conformations of a compound/polypeptide complex (i.e. compound/AR-LBD 
15 complex). 

Examples of other computer programs that may be used for structure-based 
modulator design are CAVEAT (Bartlett et al., 1989, in "Chemical and 
Biological Problems in Molecular Recognition", Roberts, S.M. Ley, S.V.; 
20 Campbell, N.M. eds; Royal Society of Chemistry: Cambridge, pp 182-196); 
FLOG (Miller et al., 1994, J. Comp. Aided Molec. Design 8:153); PRO 
Modulator (Clark et al, 1995 J. Comp. Aided Molec. Design 9:13); MCSS 
(Miranker and Karplus, 1991, Proteins: Structure, Fuction, and Genetics 8:195); 
and, GRID (Goodford, 1985, J. Med. Chem. 28:849). 

25 

In an embodiment of the invention, a method is provided for identifying potential 
modulators of AR-LBD function. The method utilizes the structural coordinates 
of an AR-LBD three dimensional structure, or binding site thereof. The method 
comprises the steps of (a) generating a computer representation of an AR-LBD 
30 structure, and docking a computer representation of a compound from a computer 
data base with a computer representation of the AR-LBD to form a complex; (b) 
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determining a conformation of the complex with a favourable geometric fit or 
favorable complementary interactions; and (c) identifying compounds that best fit 
the AR-LBD as potential modulators of AR-LBD function. The initial AR-LBD 
structure may or may not have compounds bound to it. A favourable geometric fit 
5 occurs when the surface areas of a compound in a compound-AR-LBD complex 
is in close proximity with the surface area of the AR-LBD without forming 
unfavorable interactions. A favourable complementary interaction occurs where a 
compound in a compound-AR-LBD complex interacts by hydrophobic, aromatic, 
ionic, or hydrogen donating and accepting forces, with the AR-LBD without 
10 forming unfavorable interactions. Unfavourable interactions may be steric 
hindrance between atoms in the compound and atoms in the AR-LBD. 

In another embodiment, potential modulators are identified utilizing an AR-LBD 
structure with or without compounds bound to it. The method comprises the steps 

15 of (a) modifying a computer representation of an AR-LBD having one or more 
compounds bound to it, where the computer representations of the compound or 
compounds and AR-LBD are defined by atomic structural coordinates; (b) 
determining a conformation of the complex with a favorable geometric fit and 
favorable complementary interactions; and (c) identifying the compounds that 

20 best fit the AR-LBD as potential modulators. A computer representation may be 
modified by deleting or adding a chemical group or groups. Computer 
representations of the chemical groups can be selected from a computer database. 

Another way of identifying potential modulators is to modify an existing 
25 modulator in a polypeptide binding site. The computer representation of 
modulators can be modified within the computer representation of an AR-LBD. 
This technique is described in detail in Molecular Simulations User Manual, 1995 
in LUDI. The computer representation of a modulator may be modified by 
deleting a chemical group or groups, or by adding a chemical group or groups. 
30 After each modification to a compound, the atoms of the modified compound and 
binding site can be shifted in conformation and the distance between the 
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modulator and the binding site atoms may be scored on the basis of geometric fit 
and favourable complementary interactions between the molecules. Compounds 
with favourable scores are potential modulators. 



5 Compounds designed by modulator building or modulator searching computer 
programs may be screened to identify potential modulators. Examples of such 
computer programs include programs in the Molecular Simulations Package 
(Catalyst), ISIS/HOST, ISIS/BASE, and ISIS/DRAW (Molecular Designs 
Limited), and UNITY (Tripos Associates). A building program may be used to 

10 replace computer representations of chemical groups in a compound complexed 
with an AR-LBD with groups from a computer database. A searching program 
may be used to search computer representations of compounds from a computer 
database that have similar three dimensional structures and similar chemical 
groups as a compound that binds to an AR-LBD. The programs may be operated 

15 on the structure of the AR-LBD structure. 



A typical program may comprise the following steps: 

(a) mapping chemical features of a compound such as by hydrogen bond 
donors or acceptors, hydrophobic/lipophilic sites, positively ionizable 

20 sites, or negatively ionizable sites; 

(b) adding geometric constraints to selected mapped features; 

(c) searching data bases with the model generated in (b). 



In an embodiment of the invention a method of identifying potential modulators 
25 of an AR-LBD is provided using the three dimensional conformation of the AR- 
LBD in various modulator construction or modulator searching computer 
programs on compounds complexed with the AR-LBD. The method comprises 
the steps of (a) generating a computer representation of one or more compounds 
complexed with an AR-LBD; (b) (i) searching a data base for a compound with a 
30 similar geometric structure or similar chemical groups to the generated 
compounds using a computer program that searches computer representations of 
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compounds from a database that have similar three dimensional structures and 
similar chemical groups, or (ii) replacing portions of the compounds complexed 
with the AR-LBD with similar chemical structures (i.e. nearly identical shape and 
volume) from a database using a compound construction computer program that 
5 replaces computer representations of chemical groups with groups from a 
computer database, where the representations of the compounds are defined by 
structural coordinates. 

A compound that interacts with an AR-LBD identified using a method of the 
10 invention may be used as a modulator of any AR-LBD or composition bearing 
the interacting binding domain. Therefore, the invention features a modulator of 
an AR-LBD identified by a method of the invention. 

The invention further contemplates a method for designing potential inhibitors of 
15 an AR-LBD comprising the step of using the structural coordinates of an 
inhibitor or substrate or parts thereof, defined in relation to its spatial association 
with an AR-LBD structure to generate a compound that is capable of associating 
with the AR-LBD. 

20 In an embodiment of the invention, a method is provided for designing potential 
inhibitors of an AR-LBD comprising the step of using the structural coordinates 
of AR-LBD in Table 4 to generate a compound for associating with the active 
site of an AR-LBD. The following steps are employed in a particular method of 
the invention: (a) generating a computer representation of AR-LBD defined by its 

25 structural coordinates listed in Table 4; (b) searching for molecules in a data base 
that are structurally or chemically similar to the defined AR-LBD, using a 
searching computer program, or replacing portions of the compound with similar 
chemical structures from a database using a compound building computer 
program. 



30 
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It will be appreciated that a modulator of an AR-LBD may be identified by 
generating an actual three-dimensional model of a binding cavity, synthesizing a 
compound, and examining the components to find whether the required 
interaction occurs. 

5 

Potential modulators of AR-LBD identified using the above-described methods 
may be prepared using methods described in standard reference sources utilized 
by those skilled in the art. For example, organic compounds may be prepared by 
organic synthetic methods described in references such as March, 1994, 
10 Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, New York, 
McGraw Hill. 



The invention also relates to a potential modulator identified by the methods of 
the invention. In particular, classes of modulators of AR-LBD are provided that 
15 are based on the three-dimensional structure of an inhibitor's or modulator's 
spatial association with an AR-LBD structure. 

The invention contemplates all optical isomers and racemic forms of the 
modulators of the invention. 

20 

"Modulator" refers to a molecule which changes or alters the biological activity 
of a AR-LBD. A modulator may increase or decrease AR-LBD activity, or 
change its characteristics, or functional or immunological properties. It may be an 
inhibitor that decreases the biological or immunological activity of the protein. A 
25 modulator may enhance or inhibit a biological activity of AR-LBD . 



Modulators include but are not limited to peptides, members of random peptide 
libraries and combinatorial chemistry-derived molecular libraries, 
phosphopeptides (including members of random or partially degenerate, directed 
30 phosphopeptide libraries), antibodies, carbohydrates, nucleosides or nucleotides 
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or parts thereof, and small organic or inorganic molecules. A modulator may be 
an endogenous physiological compound, or it may be a natural or synthetic 
compound. 

5 LIGAND 

The term "ligand" includes, but is not limited to, steroidal and non-steroidal 
ligands. The ligands may be natural or synthetic. The ligands may be 
structurally novel AR-LBD ligands. Alternatively, the ligands may be analogues 

10 of known AR-LBD ligands but with improved properties. The ligand may be an 
androgen mimetic. The ligand may be capable of modulating (e.g. upregulating) 
androgen receptor gene expression. Alternatively, the ligand may be capable of 
blocking the activity of androgens by binding to an AR-LBD with a high affinity. 
The ligand may be capable of down regulating androgen receptor gene 

15 expression. The term "ligand" also refers to a chemically modified ligand. 

The ligand may act, for example, as an agonist, a partial agonist, an antagonist, 
and/or a competitive antagonist of the androgen receptor. 

20 For some embodiments, the ligand is in a purified and/or isolated form. 

DESIGNER LIGANDS 

As used herein, the term means "designer ligands" refers to test compounds 
25 which are likely to bind to the AR-LBD based on their three dimensional shape 
compared to that of the androgen receptor and in particular the AR-LBD. 

Preferably, those compounds have a structure which is complementary to that of 
the AR- 
30 LBD. 
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Preferably the ligands comprise ligand substituents which compensate for the 
structural changes in the ligand binding pocket (LBP) between the wild type and 
mutant AR-LBDs. 

5 

The test compound may be tested for its interaction with an interacting amino 
acid residue in the AR-LBD. Alternatively, the test compound may affect ligand 
binding by acting either as an agonists or an antagonists. 

10 AGONIST 

As used herein, the term "agonist" means any ligand, which is capable of binding 
to an AR-LBD and which is capable of increasing a proportion of the AR that is 
in an active form, resulting in an increased biological response. The term 
15 includes partial agonists and inverse agonists. 

PARTIAL AGONIST 

As used herein, the term "partial agonist" means an agonist that is unable to 
20 evoke the maximal response of a biological system, even at a concentration 
sufficient to saturate the specific receptors. 

INVERSE AGONIST 

25 As used herein, the term "partial inverse agonist" is an inverse agonist that 
evokes a submaximal response to a biological system, even at a concentration 
sufficient to saturate the specific receptors. At high concentrations, it will 
diminish the actions of a full inverse agonist. 
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ANTAGONIST 

As used herein, the term "antagonist" means any agent that reduces the action of 
another agent, such as an agonist. The antagonist may act at the same receptor as 
5 the agonist. The antagonistic action may result from a combination of the 
substance being antagonised (chemical antagonism) or the production of an 
opposite effect through a different receptor (functional antagonism or 
physiological antagonism) or as a consequence of competition for the binding site 
of an intermediate that links receptor activation to the effect observed (indirect 
10 antagonism). 

COMPETITIVE ANTAGONIST 

As used herein, the term "competitive antagonism" refers to the competition 
15 between an agonist and an antagonist for a receptor that occurs when the binding 
of agonist and antagonist becomes mutually exclusive. This may be because the 
agonist and antagonist compete for the same binding site or combine with 
adjacent but overlapping sites. A third possibility is that different sites are 
involved but that they influence the receptor macromolecules in such a way that 
20 agonist and antagonist molecules cannot be bound at the same time. If the 
agonist and antagonist form only short lived combinations with the receptor so 
that equilibrium between agonist, antagonist and receptor is reached during the 
presence of the agonist, the antagonism will be surmountable over a wide range 
of concentrations. In contrast, some antagonists, when in close enough proximity 
25 to their binding site, may form a stable covalent bond with it and the antagonism 
becomes insurmountable when no spare receptors remain. 

In one aspect, the identified ligand may act as a ligand model (for example, a 
template) for the development of other compounds. 

30 
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LIGAND MODEL 

The term "ligand model" refers to the structural coordinates of a compound that 
fits into the AR-ligand binding domain (LBD) and which may be used for 
5 modeling to identify and/or design ligands (designer ligands) capable of binding 
to the AR-LBD, such as for the subsequent modulation thereof. 

One skilled in the art may use one of several methods to test compounds for their 
ability to associate with AR-LBD. This process may begin by visual inspection 

10 of, for example, a target site on the computer screen based on the structure 
coordinates given in Table 4. Selected test compounds may then be positioned in 
a variety of orientations, or docked, within an individual target site of AR-LBD 
as defined supra. Docking may be accomplished using software such as Quanta 
and Sybyl, followed by energy minimization and molecular dynamics with 

1 5 standard molecular mechanics forcefields, such as CHARMM and AMBER. 

Specialized computer programs may also assist in the process of selecting 
potential ligands. These include:. 

1. GRID (Goodford, P. J., "A Computational Procedure for Determining 
Energetically Favorable Binding Sites on Biologically Important 

20 Macromolecules", J. Med. Chem., 28, pp. 849-857 (1985)). GRID is available 
from Oxford University, Oxford, UK. 

2. MCSS (Miranker, A. and M. Karplus, "Functionality Maps of Binding 
Sites: A Multiple Copy Simultaneous Search Method." Proteins: Structure. 
Function and Genetics, 11, pp. 29-34 (1991)). MCSS is available from Molecular 

25 Simulations, Burlington, Mass. 

3. AUTODOCK (Goodsell, D. S. and A. J. Olsen, "Automated Docking 
of Substrates to Proteins by Simulated Annealing", Proteins: Structure. Function, 
and Genetics, 8, pp. 195-202 (1990)). AUTODOCK is available from Scripps 
Research Institute, La Jolla, Calif. 
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4. DOCK (Kuntz, L D. et al, "A Geometric Approach to Macromolecule- 
Ligand Interactions", J. Mol. Biol., 161, pp. 269-288 (1982)). DOCK is available 
from University of California, San Francisco, Calif. 

5 Once a ligand has been optimally selected or designed, substitutions may then be 
made in some of its atoms or side groups in order to improve or modify its 
binding properties. Generally, initial substitutions are conservative, i.e., the 
replacement group will have approximately the same size, shape, hydrophobicity 
and charge as the original group. It should, of course, be understood that 
10 components known in the art to alter conformation should be avoided. Such 
substituted chemical compounds may then be analyzed for efficiency of fit to 
AR-LBD by the same computer methods described above. 

Preferably, positions for substitution are selected based on the predicted binding 
15 orientation of a test compound to the AR-LBD. 

The ligands of the present invention may be natural or synthetic. The term 
"ligand" also refers to a chemically modified ligand. 

20 SYNTHESIS METHODS 

The ligand of the present invention or mimetics thereof may be produced using 
chemical methods to synthesize the ligand in whole or in part. For example, 
peptides can be synthesized by solid phase techniques, cleaved from the resin, 
25 and purified by preparative high performance liquid chromatography (e.g., 
Creighton (1983) Proteins Structures And Molecular Principles, WH Freeman 
and Co, New York NY). The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure; Creighton, supra). 

30 
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Direct synthesis of the ligand or mimetics thereof can be performed using various 
solid-phase techniques (Roberge JY et al (1995) Science 269: 202-204) and 
automated synthesis may be achieved, for example, using the ABI 43 1 A Peptide 
Synthesizer (Perkin Elmer) in accordance with the instructions provided by the 
5 manufacturer. Additionally, the amino acid sequences obtainable from the 
ligand, or any part thereof, may be altered during direct synthesis and/or 
combined using chemical methods with a sequence from other subunits, or any 
part thereof, to produce a variant ligand. 

10 In an alternative embodiment of the invention, the coding sequence of the ligand 
or mimetics thereof may be synthesized, in whole or in part, using chemical 
methods well known in the art (see Caruthers MH et al (1980) Nuc Acids Res 
Symp Ser 215-23, Horn T et al (1980) Nuc Acids Res Symp Ser 225-232). 

15 Hence, the ligands may be chemically synthesised or they may be prepared using 
recombinant techniques. 

In one aspect, preferably, the ligand is prepared by the use of chemical synthesis 
techniques. 

20 

RECOMBINANT METHODS 

In another aspect, preferably the ligands of the present invention may be 
produced from host cells using recombinant techniques. 

25 

A wide variety of host cells can be employed for expression of the nucleotide 
sequences encoding the ligands of the present invention. These cells may be both 
prokaryotic and eukaryotic host cells. Suitable host cells include bacteria such as 
E. coli, yeast, filamentous fungi, insect cells, mammalian cells, typically 
30 immortalized, e.g., mouse, CHO, human and monkey cell lines and derivatives 
thereof. Preferred host cells are able to process the expression products to 
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produce an appropriate mature polypeptide. Processing includes but is not limited 
to glycosylation, ubiquitination, disulfide bond formation and general post- 
translational modification. 

5 CHEMICAL MODIFICATION 

In one embodiment of the present invention, the ligand may be a chemically 
modified ligand. 

10 The chemical modification of a ligand and/or a key amino acid residue of the 
present invention may either enhance or reduce hydrogen bonding interaction, 
charge interaction, hydrophobic interaction, Van Der Waals interaction or dipole 
interaction between the ligand and the key amino acid residue(s) of the AR-LBD. 
By way of example, steric hinderance is a common means of changing the 

15 interaction of the AR-LBD binding domain with the activation domain. 

Preferably such modifications involve the addition of substituents onto a test 
compound such that the substituents are positioned to collide or to bind 
preferentially with one or more amino acid residues that correspond to the key 
20 amino acid residues of AR-LBD of the present invention. 

COMPARATIVE MODELS 

The unique features involved in AR selective ligand binding can be identified by 
25 comparing crystal structures of different steroid receptors, such as the AR and the 
progesterone (PR) receptors and/or isoforms of the same type of receptor. 

In a seventh aspect the present invention provides the use of a ligand identified by a 
method of screening which comprises the use of a crystal structure comprising an 
30 AR-LBD in the preparation of a medicament to prevent and/or treat androgen 
related disorders 0 , 
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DISORDERS 

The term androgen related disorders relates to disorder such as prostrate cancer 
5 (PC), androgen insensitivity syndrome (AIS), partial androgen insensitivity 
syndrome (PAIS), mild androgen insensitivity syndrome (MAIS) and complete 
androgen insensitivity syndrome (CAIS). 

PHARMACEUTICAL COMPOSITIONS 

10 

In a further aspect, the present invention provides a pharmaceutical composition, 
which comprises a ligand according to the present invention and optionally a 
pharmaceutically acceptable carrier, diluent or excipient (including combinations 
thereof). The pharmaceutical composition may comprise or may be used in 
15 conjunction with an additional pharmaceutically active compound or 
composition. 

The pharmaceutical compositions may be for human or animal usage in human 
and veterinary medicine and will typically comprise any one or more of a 

20 pharmaceutically acceptable diluent, carrier, or excipient. Acceptable carriers or 
diluents for therapeutic use are well known in the pharmaceutical art, and are 
described, for example, in Remington's Pharmaceutical Sciences, Mack 
Publishing Co. (A. R. Gennaro edit 1985). The choice of pharmaceutical 
carrier, excipient or diluent can be selected with regard to the intended route of 

25 administration and standard pharmaceutical practice. The pharmaceutical 
compositions may comprise as - or in addition to - the carrier, excipient or 
diluent any suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), 
solubilising agent(s). 

30 Preservatives, stabilizers, dyes and even flavouring agents may be provided in the 
pharmaceutical composition. Examples of preservatives include sodium 
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benzoate, sorbic acid and esters of p-hydroxybenzoic acid. Antioxidants and 
suspending agents may be also used. 

There may be different composition/formulation requirements dependent on the 
5 different delivery systems. By way of example, the pharmaceutical composition 
of the present invention may be formulated to be delivered using a mini-pump or 
by a mucosal route, for example, as a nasal spray or aerosol for inhalation or 
ingestable solution, or parenterally in which the composition is formulated by an 
injectable form, for delivery, by, for example, an intravenous, intramuscular or 
10 subcutaneous route. Alternatively, the formulation may be designed to be 
delivered by both routes. 



Where the pharmaceutical composition is to be delivered mucosally through the 
gastrointestinal mucosa, it should be able to remain stable during transit though 
15 the gastrointestinal tract; for example, it should be resistant to proteolytic 
degradation, stable at acid pH and resistant to the detergent effects of bile. 

Where appropriate, the pharmaceutical compositions can be administered by 
inhalation, in the form of a suppository or pessary, topically in the form of a 

20 lotion, solution, cream, ointment or dusting powder, by use of a skin patch, orally 
in the form of tablets containing excipients such as starch or lactose or chalk, or 
in capsules or ovules either alone or in admixture with excipients, or in the form 
of elixirs, solutions or suspensions containing flavouring or colouring agents, or 
they can be injected parenterally, for example intravenously, intramuscularly or 

25 subcutaneously. For parenteral administration, the compositions may be best 
used in the form of a sterile aqueous solution which may contain other 
substances, for example enough salts or monosaccharides to make the solution 
isotonic with blood. For buccal or sublingual administration the compositions 
may be administered in the form of tablets or lozenges which can be formulated 

30 in a conventional manner. 
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ADMINISTRATION 

The invention further provides a method of preventing and/or treating an 
androgen related disorder in a mammal, the method comprising administering to 
5 a mammal a ligand which binds to at least the AR-LBD with high affinity, and in 
some cases to such an extent so as to modulate said AR-LBD. In one aspect, the 
block binding of further ligands to at least the AR-LBD. Such ligands may be 
useful in, for example, the treatment of AR mediated disorders in males or 
females. 

10 

Typically, a physician will determine the actual dosage which will be most 
suitable for an individual subject and it will vary with the age, weight and 
response of the particular patient and severity of the condition. The dosages 
below are exemplary of the average case. There can, of course, be individual 
15 instances where higher or lower dosage ranges are merited. 

The compositions (or component parts thereof) of the present invention may be 
administered orally. In addition or in the alternative the compositions (or 
component parts thereof) of the present invention may be administered by direct 

20 injection. In addition or in the alternative the compositions (or component parts 
thereof) of the present invention may be administered topically. In addition or in 
the alternative the compositions (or component parts thereof) of the present 
invention may be administered by inhalation. In addition or in the alternative the 
compositions (or component parts thereof) of the present invention may also be 

25 administered by one or more of: parenteral, mucosal, intramuscular, intravenous, 
subcutaneous, intraocular or transdermal administration means, and are 
formulated for such administration. 

By way of further example, the pharmaceutical composition of the present 
30 invention may be administered in accordance with a regimen of 1 to 10 times per 
day, such as once or twice per day. The specific dose level and frequency of 
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dosage for any particular patient may be varied and will depend upon a variety of 
factors including the activity of the specific compound employed, the metabolic 
stability and length of action of that compound, the age, body weight, general 
health, sex, diet, mode and time of administration, rate of excretion, drug 
5 combination, the severity of the particular condition, and the host undergoing 
therapy. 

The term "administered" also includes but is not limited to delivery by a mucosal 
route, for example, as a nasal spray or aerosol for inhalation or as an ingestable 
10 solution; a parenteral route where delivery is by an injectable form, such as, for 
example, an intravenous, intramuscular or subcutaneous route. 

Hence, the pharmaceutical composition of the present invention may be 
administered by one or more of the following routes: oral administration, 
15 injection (such as direct injection), topical, inhalation, parenteral administration, 
mucosal administration, intramuscular administration, intravenous 
administration, subcutaneous administration, intraocular administration or 
transdermal administration. 

20 STRUCTURAL STUDIES 

One aspect of the invention provides to a method of determining the secondary 
and/or tertiary structures of polypeptides with unknown structures comprising the 
step of using a crystal structure or model of the invention. 

25 

The polypeptide under investigation is preferably structurally or functionally 
related to the androgen receptor ligand binding domain. For example, the 
polypeptide may show a degree of homology over some or all parts of the 
primary amino acid sequence. 

30 
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As applied to polypeptides, the term "substantial sequence identity" means that 
two peptide sequences, when optimally aligned, such as by the programs GAP or 
BESTFIT using default gap, share at least 40%, 50%, 60%, 65%, 70%, 75%, 
80%, or 85% sequence identity, preferably at least 90 percent sequence identity, 
5 more preferably at least 95 percent sequence identity or more. Preferably, residue 
positions which are not identical differ by conservative amino acid substitutions. 
For example, the substitution of amino acids having similar chemical properties 
such as charge or polarity are not likely to effect the properties of a protein. 
Examples include glutamine for asparagine or glutamic acid for aspartic acid. 

10 

In a further embodiment, the invention relates to a method of determining three 
dimensional structures of polypeptides with unknown structures, preferably a 
native or mutated AR-LBD by applying the structural coordinates of an AR-LBD 
structure of the invention to nuclear magnetic resonance (NMR) data of the 

15 unknown structure. This method comprises the steps of: (a) determining the 
secondary structure of an unknown structure using NMR data; and (b) 
simplifying the assignment of through-space interactions of amino acids. The 
term " through-space interactions" defines the orientation of the secondary 
structural elements in the three dimensional structure and the distances between 

20 amino acids from different portions of the amino acid sequence. The term 
"assignment" defines a method of analyzing NMR data and identifying which 
amino acids give rise to signals in the NMR spectrum. 

The polypeptide may, for example be a mutant form of an AR-LBD. The term 
25 "mutant" refers to a polypeptide that is obtained by replacing at least one amino 
acid residue in a native AR-LBD with a different amino acid residue. Mutation 
can also be accomplished by adding and/or deleting amino acid residues within 
the native AR-LBD or part thereof A mutant may or may not be functional. 

30 Alternatively, the polypeptide may be an AR-LBD from a different species. 
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Alternatively, the polypeptide may perform an analogous function or be 
suspected to show a similar binding mechanism to the AR-LBD. 



ANDROGEN RECEPTOR LK3AND BINDING DOMAIN STRUCTURES 

5 

The present invention provides a secondary or three-dimensional structure of an 
AR-LBD or part thereof. In an embodiment the structure is a crystalline form An 
AR-LBD structure may comprise an AR-LBD unit cell. 

10 An AR-LBD structure includes the secondary or three-dimensional structure of a 
native AR-LBD, a derivative AR-LBD, or a mutant AR-LBD. Thus, a crystalline 
form includes native crystals, derivative crystals, and co-crystals. The crystals 
generally comprise a substantially pure AR-LBD in crystalline form. It is 
understood that the AR-LBD structures of the invention are not limited to a 

15 naturally occurring or native AR-LBD but include polypeptides with substantial 
sequence identity to an AR-LBD. An AR-LBD structure also includes mutants of 
a native AR-LBD obtained by replacing at least one amino acid residue in a 
native AR-LBD with a different amino acid residue, or by adding or deleting 
amino acid residues within the native polypeptide, and having substantially the 
20 same secondary or three-dimensional structure as the native AR-LBD from which 
the mutant is derived i.e. having a set of atomic structural coordinates that have a 
root mean square deviation of less than or equal to about 5, 4, 3, 2, or 1 .5 A when 
superimposed with the atomic structure coordinates of the native AR-LBD from 
which the mutant is derived when at least 50% to 100% of the atoms of the native 
25 AR-LBD are included in the superimposition. It should be noted that the AR- 
LBD structures contemplated herein need not exhibit AR-LBD activity. 

A derivative AR-LBD structure of the invention comprises an AR-LBD structure 
in association with one or more moieties that are heavy metal atoms. For 
30 example, derivative crystals of the invention generally comprise a crystalline AR- 
LBD in covalent association with one or more heavy metal atoms. The AR-LBD 
may correspond to a native or mutated AR-LBD. Heavy metal atoms useful for 
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providing derivative AR-LBD structures include by way of example, and not 
limitation, gold, mercury, etc. 

The invention features an AR-LBD structure in association with one or more 
5 moieties that are ligands. The association may be covalent or non-covalent. 
Crystalline forms of this type are referred to herein as co-crystals. The compound 
may be any organic molecule, and it may modulate the function of an AR-LBD 
by for example inhibiting or enhancing its function, or it may be a substrate for 
the AR-LBD. It is preferred that the geometry of the compound and the 
10 interactions formed between the compound and the AR-LBD provide high 
affinity binding between the two molecules. 



The secondary or three-dimensional structures of the particular AR-LBD 
described herein provide useful models for the secondary or three-dimensional 
15 structures of AR-LBD from any species, particularly mammalian, including 
bovine, ovine, porcine, murine, equine, preferably human, from any source 
whether natural, synthetic, semi-synthetic, or recombinant. 

In a particular embodiment of the invention, a secondary or three-dimensional 
20 crystal structure of an AR-LBD that associates with an inhibitor of an AR-LBD is 
provided comprising at least two or three atomic contacts of atomic interactions 
in Figure 4, each atomic interaction defined therein by an atomic contact (more 
preferably, a specific atom where indicated) on the inhibitor, and an atomic 
contact (more preferably, a specific amino acid residue where indicated) on the 
25 AR-LBD (i.e. ligand atomic contact). The binding domain may be defined by the 
ligand atomic contacts of atomic interactions in Figure 4. Preferably, the binding 
domain is defined by the atoms of the ligand atomic contacts having the structural 
coordinates for the atoms listed in Table 4. 
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IDENTIFICATION OF HOMOLOGUES 

The knowledge of an AR-LBD structure of the invention enables one skilled in 
the art to identify homologues of AR-LBD. This is achieved by searches of three- 
5 dimensional databases. Since structural folds are conserved to a greater extent 
than sequence, one may identify homologues with very little sequence identity or 
similarity. Programs that provide this type of database searching are known in the 
art and include DalL The structural coordinates of a protein structure are 
submitted and the program performs a multiple structural alignment with proteins 
10 in the protein data bank. Homologues identified in accordance with the present 
invention may be used in the methods of the invention described herein. 

PROGESTERONE (PR) RECEPTOR 

15 The present invention also provides experimentally isolated crystals for the PR- 
LBD in complex with the ligand metribolone (R1881). From these 
experimentally isolated crystals, a three dimensional (3-D) structure for the PR 
receptor has been produced to medium resolution. The PR-LBD comprises a 
LBD which is substantially the same as the LBD of the AR-LBD except that the 

20 LBD comprises a stronger bending of the helices H10 and HI 1 and helix H9 has 
a length which is at least one helical turn shorter than the AR-LBD. The 
sequence for the wild type PR-LBD site comprises at least SEQ ID No 3 (see 
Figure 1). The PR-LBD-R1881 crystal complex belongs to the space group P2i 
and having the unit dimensions a = 58.40A, b = 65.0 A, c = 71.18A and an angle 

25 P of 95.7° and with the unit cell dimensions as presented in Table 1 . 

The present invention also demonstrates the surprising finding that the two 
independent molecules in the crystal structure of hPR LBD-R1881 exhibit 
different modes of ligand binding. One orientation pf R1881 in one monomer 
30 resembles that of R188 1 in the hAR LBD complex while in the second monomer 
R1881 is orientated similar to progesterone in the hPR LBD-progesterone 
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complex. Thus it may be possible to design ligands that selectively bind to either 
one or both of the monomers in the hPR-LBD-ligand complex, thereby 
dissociating desirable preventative and/or therapeutic effects from undesirable 
side effects of PR ligands. 

5 

A partial homology model of the AR receptor has been created based on the 
experimentally derived hPR-LBD-progesterone crystal complex. This homology 
model captures the essential difference in binding between the AR-LBD crystal 
and AR-LBD model structures. This homology model also highlights the 
10 differences with respect to the secondary structure alignment between the model 
structure of the present invention and that from other published models. 

By way of example, the model structure of the present invention differs from 
other published models [Yong, 1998] with respect to the secondary structure 
15 alignment. Yong [1998] based their model on the crystal structure of the RARa 
LBD [Bourguet, 1995]. The secondary structure assignment by Yong et al as 
compared to the hAR LBD crystal structure is similar between helices H3 and 
H10, but the assignment differs most for helices Hll, H12 and the additional 
helix at the C-terminal end. 

20 

The ligand binding pocket interactions of the present invention have been 
determined using the hAR LBD-R1881 crystal structure and the hPR LBD- 
R1881 complex. 

25 Based on a comparison of the LBP interactions, the differences in ligand binding 
specificities between the AR and PR can be determined. Using these differences, 
the ability of a ligand to bind to either or both of the AR and PR may be 
predicted. Hence, if it is known that one tissue possesses solely one form of an 
AR and/or PR receptor, then it may be possible to confer a degree of tissue 

30 specificity to a ligand by designing a ligand to bind the predominant form of the 
AR and/or PR present in that tissue. 
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Thus, the present invention also provides an understanding of the differences 
between R1881 and progesterone binding to AR and PR receptors and therefore a 
means to design AR and PR ligands with the desired degree of efficacy. 

5 

The present invention also provides a crystal model comprising the hPR-LBD 
which is built from all or part of the crystal co-ordinate data as shown in Table 5 
(see Figure 7). 

10 The present invention also covers these novel aspects and their uses. In this 
respect, the teachings of the AR-LBD (i.e. hAR-LBD) are equally applicable to 
the novel aspects of the PR-LBD (i.e. hPR-LBD). 

Thus, for example, aspects of the present invention concerning PR-LBD relate to; 

15 

A crystal structure comprising a PR-LBD. 
A crystal structure for PR-LBD. 

20 A crystal PR-LBD having the structural co-ordinates as set forth in Table 

5. 

Acrystal structure comprising a PR-LBD-ligand complex. 
25 A crystal structure comprising a PR-LBP. 

A model of at least part of an PR-LBD made using or comprising or 
depicting a crystal structure according to any one of the foregoing aspects 
of the invention. The crystal structure and the model may be provided in 
30 the form of a computer readable medium. 
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A method of screening for a ligand capable of binding an androgen 
receptor binding domain, comprising the use of a crystal structure or a 
model of PR-LBD. For example, the method may comprise the step of 
contacting the PR-LBD with a test compound, and determining if said test 
5 compound binds to said ligand binding domain. The method may be an in 

vitro method and/or an in silico method and/or an in vivo method. 

A ligand identified by a screening method of a foregoing aspect of the 
invention. Preferably the ligand is capable of modulating the activity of a 
10 PR-LBD. As mentioned above, ligands which are capable of modulating 

the activity of PR-LBDs have considerable therapeutic and prophylactic 
potential. 

The use of a ligand according to the foregoing aspect of the invention, in 
15 the manufacture of a medicament to treat and/or prevent a disease in a 

mammalian patient. There is also provided a pharmaceutical composition 
comprising such a ligand and a method of treating and/or preventing a 
disease comprising administering the step of administering such a ligand 
according or pharmaceutical composition to a mammalian patient. 

20 

The crystal structures and models described above also provide 
information about the secondary and tertiary structure of PR-LBDs. This 
can be used to gleen structural information about other, previously 
uncharacterised polypeptides. Thus, according to one aspect of the 

25 invention there is provided a method of determining the secondary and/or 

tertiary structures of polypeptides with unknown (or only partially known) 
structure comprising the step of using such a crystal or model. The 
polypeptide under investigation is preferably structurally or functionally 
related to the progesterone receptor ligand binding domain. For example, 

30 the polypeptide may show a degree of homology over some or all parts of 

the primary amino acid sequence. Alternatively, the polypeptide may 
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perform an analogous function or be suspected to show a similar binding 
mechanism to the PR-LBD. 

EXAMPLES 

5 

The invention will now be further described only by way of example in which 
reference is made to the following Figures: 

Figure 1 which shows a sequence listing for hAR-LBD (SEQ ID No 1) and HPR 
10 LBD (SEQ ID No 3) amino acid sequences and a secondary structure for hAR- 
LBD (SEQ ID No 2). SEQ ID No 1 is presented in the second line of Figure 1, 
SEQ ID No 3 is presented as the first line in Figure 1 . SEQ ID No 2 is presented 
as the third line in Figure 1 . 

15 Figure 2 which shows chemical formulae; 

Figure 3 which shows three dimensional structures of hAR LBD and hPR LBD 
complexed with metribolone (R1881); 

20 Figure 4 which shows a stereo diagrams showing interactions between a bound 
Iigand and protein chain in hAR-LBD and hPR-LBD ligand complexes; 

Figure 5 which shows a stereo diagram showing the location of hAR-LBP 
pathogenic mutations; 

25 

Figure 6 which presents Table 4, which has the structural co-ordinates for the 
hAR-LBD; and 

Figure 7 which presents Table 5, which has the structural co-ordinates for the 
30 hPR-LBD. 
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In more detail: 

Figure 1 shows a comparison between hAR LBD and hPR LBD amino acid 
sequences. The numbering scheme of AR is according to [Lubahn, 1988]. The 
5 sequence alignment was performed with CLUSTALW [Thompson, 1994]. The 
residue number applies to the residue directly above or below the last digit. 
Identical residues are outlined in solid black boxes; gray shading denotes the 
residues not located in the electron density and thus not included in the model. 
Selected secondary structure elements are from PROCHECK [Laskowski, 1993] 

10 according to Kabsch & Sander [Kabsch, 1983]: E, strand in (3-sheet; H, a-helix; 
Amino acids interacting with bound ligands (R1881 or progesterone) are 
coloured red (van der Waals cutoff distance 4.0 A). The mutations presently 
known for AIS in the hAR LBD are marked below the appropriate position of the 
respective amino acid in the hAR LBD. Abbreviations: x = prostate cancer, p = 

15 PAIS/MAIS, c = CAIS, a - PAIS/MAIS and CAIS, b = PAIS/MAIS and prostate 
cancer, v = CAIS and prostate cancer, w = PAIS/MAIS and CAIS and prostate 
cancer. 

Figure 2 shows the numbering scheme of R1881 (left) and progesterone (right). 

20 

Figure 3 shows the diagrams of the three-dimensional structures of hAR-LBD 
and hPR-LBD complexed with R1881. (A) MOLSCRIPT/Raster3D [Kraulis, 
1991; Merritt, 1994] ribbon diagram of hAR LBD. (B) MOLSCRIPT Stereoview 
of the C a -trace of the superimposed hAR LBD R1881 (black) and hPR LBD 
25 R1881 (red) structures showing the hAR-LBD residue numbering. The (B) view 
is related to (A) by a clockwise 90° rotation about the vertical axis. 

Figure 4 shows the stereo diagrams showing the interactions between the bound 
ligand and the protein chain in hAR LBD - R1881 (A), hPR LBD - R1881 
30 (molecule B) (B) and hPR LBD - progesterone (C). Residues included are either 
hydrogen-bonded or have Van der Waals contacts (cutoff distance 4.0 A) with 
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any of the ligands. Residues V685, Y763 in hAR LBD and corresponding 
residues 1699, Y777 in hPR LBD are hydrogen-bonded to other residues or water 
molecules near the ligand binding site and are also included. Bound ligand is 
coloured black, conserved residues are coloured gray, different residues in hAR 
5 LBD and hPR LBD are coloured red. Residue labels with an asterisk (*) denote 
residues that do not have Van der Waals contacts within the specified cutoff 
distance with the ligand. Hydrogen bond distances for the hPR LBD - 
progesterone complex were calculated from the PDB deposited coordinates of 
molecule A. Figures produced with MOL SCRIPT [Kraulis, 1991]. 

10 

Figure 5 shows the stereo diagram showing the location of the hAR LBP 
pathogenic mutations: the coloured spheres are represented at the residue's C a 
position: mutations observed in prostate cancer (PC) are represented in red, those 
observed for CAIS are shown in yellow and those observed for PAIS/MAIS are 
15 drawn in cyan. Mutation of one residue (Met 749) is implicated in both prostate 
cancer and CAIS and is represented in orange. Figure produced with 
MOLSCRIPT [Kraulis, 1991] and Raster3D [Merritt, 1994]. The view is rotated 
by about 80° clockwise about a vertical axis with respect to the orientation shown 
in Figure 3 A. 

20 

Example 1 

Plasmid constructs 

25 The cDNAs coding for the human androgen and progesterone receptors were 
obtained from the groups of A. Cato (Forschungszentrum Karlsruhe, Germany) 
and P. Chambon (IGBMC, Strasbourg, France) respectively. The ligand binding 
domains (LBD) of the androgen receptor (amino acid residues (aa) 663 - 919) 
and the progesterone receptor (aa 678 - 933) were amplified by the PGR 

30 technology using appropriate primers and cloned into a pGEX-KG vector [Hakes, 
1991]. The resulting fusion proteins consisted of a glutathion-S-transferase, 
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containing a C-terminal thrombin cleavage site, optimised by a glycine-rich 
"linker" region followed by the corresponding LBD. The constructs were then 
transformed into the E. coli strain BL21 (DE3). 

5 Protein expression and purification 

Fermentation using the corresponding recombinant E. Coli strains expressing hAR 
LBD was carried out in 2XYT medium in the presence of ampicillin (200 ug/ml) 
supplemented with lOuM R1881. Expression was induced with 30 juM IPTG 

10 (isopropyl-p-D-thiogalactoside) and the fermentation (10 L) was continued at 
15 °C for 14 — 16 hours. Cells were harvested by centrifugation and disrupted 
twice in a continuous high pressure homogeniser (9000PSI) in a buffer containing 
50 mM Tris/HCl, pH 8, 150 mM NaCl, 5 mM EDTA, 10 % Glycerol, 100 uM 
R1881, 100 uM PMSF and 10 mM DTT. All buffers were purged with nitrogen 

15 before adding DTT. The supernatants from ultracentrifugation were loaded onto a 
glutathione sepharose column, washed with 50 mM TrisJHCl, pH 8, 150 mM 
NaCl, 5 mM EDTA, 10 % Glycerol, 10 uM R1881, 0.1% n-octyl-p-glucoside and 
1 mM DTT and the fusion protein was eluted using the same buffer supplemented 
with 15 mM reduced glutathione. The eluate was diluted with 100 mM HEPES 

20 pH 7.2, 150 mM NaCl, 0.5 mM EDTA, 10% glycerol, 10 uM R1881, 1 mM DTT 
and 0.1% n-octyl-p-glucoside up to a fused protein concentration of 1 mg/ml. A 
thrombin cleavage (2 N.I.H. units/mg fusion protein) was performed overnight at 
4°C. The protein mixture was further diluted three fold with 10 mM HEPES pH 
7.2, 10% glycerol, 10 nM R1881, 10 mM DTT and 0.1% n-octyl-p-glucoside and 

25 loaded onto a Fractogel S0 3 " column and eluted with a gradient of 50-500 mM 
NaCl in a 10 mM HEPES buffer pH 7.2, 10% glycerol supplemented with 10 nM 
R1881, 10 mM DTT and 0.1% n-octyl-p-glucoside. Approximately 2.4 mg of 
purified hAR LBD can be recovered from 1L E.Coli cell cultures. Protein 
concentration was determined with Bio-rad Protein Assay. Fermentation and 
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purification of the hPR LBD was performed identically but a HEPES buffer pH 
7.3 was used from the beginning. 

Results 1 

5 

Protein expression and purification 

Glutathion-S -transferase fusion proteins can be expressed to very high levels in 
the E. coli strain BL 21 (DE3) [Hakes, 1991]. We and others have used this 

10 system successfully for the production of the ligand binding domains of the 
human progesterone [Williams, 1998] and androgen receptors. An optimal and 
stable expression of soluble fusion proteins strongly depends on the presence of 
ligand in the cells during fermentation (data not shown). During cell disruption, 
purification and concentration any protein oxidation was avoided. Therefore all 

15 buffers were purged carefully with nitrogen and DTT was used as an antioxidant. 
Fusion proteins were purified by the use of Glutathion sepharose and 
subsequently cleaved with thrombin. Ligand binding domains were separated 
from the cleavage products and thrombin by cation exchange chromatography. 
Concentration was performed with the aid of a nitrogen pressure diafiltration 

20 system and the concentrate was immediately used for crystallisation experiments. 

Example 2 

Crystallisation and data collection 

25 

Both proteins were dialysed after purification with buffer containing 50mM 
HEPES pH 7.2 for hAR LBD, or lOmM HEPES pH 7.2 for hPR LBD, 
respectively, 10% glycerol, lOmM DTT, 0.1% n-octyl-p-glucoside, lOmM 
R1881 and 150mM Li2SC>4 and were concentrated up to 3 mg/ml for the hPR 
30 LBD - Rl 88 1 and up to 4.4 mg/ml for the hAR LBD - Rl 88 1 respectively. Both 
proteins were crystallised using the vapour diffusion method at 20°C for the hAR 
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LBD complex and at 4°C for hPR LBD complex respectively. Due to the 
instability and continuous precipitation of both proteins, crystallisation 
experiments had to be set up immediately after concentration. For the hAR LBD - 
R1881 complex, the reservoir solution contained 0.4M Na2HP0 4 -2(H 2 0), 0.4M 
5 K 2 HP0 4 , 0.1M TRIS-HC1 pH 8.5, 0.1M (NH 4 ) 2 HP0 4 and 5% PEG200. Drops 
were composed of equal volumes of protein and reservoir solution and were set 
up using the sitting drop method. Within two days crystals appeared and grew to 
typical dimensions of 50x50x80 jum 3 surrounded of precipitate. Crystals were 
flash frozen using a cryoprotecting solution of 60% PEG 400 in 0.1M TRIS-HC1 

10 pH 8.5. Data was collected from one crystal at the ESRF (Grenoble, France) at 
beamline ID14-EH4 to a resolution of 2.4 A. For the hPR LBD - R1881 complex, 
the reservoir solution contained 10% iso-propanol and lOOmM sodium citrate in 
50mM HEPES pH 7.5. The drops were set up using the hanging drop method and 
were composed of a 2:1 ratio of protein and reservoir solution. First crystals 

15 appeared after five weeks and grew to a size of approximately 160x120x40 \xm 3 . 
One crystal was flash frozen using a cryo-protecting solution containing 30% 
glycerol. Data were collected at beamline BM14 at the ESRF (Grenoble, France) 
to a resolution of 2.8 A. Before data collection was complete the crystal 
decomposed in the X-ray beam. 

20 

Both data sets were integrated and reduced using DENZO and SCALEPACK 
[Otwinowski, 1997]. Statistics of X-ray data collection and processing are 
summarised in Table 1 
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Table 1. Summary of data collection, processing and scaling 



hARLBD-R1881 



hPRLBD-R1881 




P2i2i2i 

a=54.28 b=66.14 c=71.72 A 

0.9324 

24.4-2.40 

37,443 

10,638 

99.8 (99.9) 

3.5 

0.078 (0.351) 

12.0 

49.4 



P2i 

a=58.40 b=65.01 c=71.18 A 0=95.7° 

0.9537 

12.47-2.80 

67,655 

8,875 

67.0 (68.8) 
7.6 

0.048 (0.151) 

15.2 

48.2 



• Values in parentheses refer to the last resolution shell, 2.46 > d > 2.40 A for 
hAR LBD - R1881 complex and 2.87 > d > 2.80 A for hPR LBD - R1881 



Structure determination 

Contrary to the hPR LBD - progesterone complex which crystallises with one 
10 homodimer in the monoclinic space group P2i the hAR LBD crystallises with 
one monomer in the orthorhombic spacegroup P2i2i2i. Therefore the structure 
determination for the hAR LBD - R1881 complex was carried out using the 
molecular replacement method in AMoRe [Navaza, 1994] with the coordinates of 
only the monomer A of the hPR LBD dimer (PDB entry: 1A28, [Williams, 1998 
15 ]) without the progesterone ligand. The hPR LBD - Rl 881 complex crystallises in 
the same monoclinic space group P2i and with similar cell constants as the hPR 
LBD - progesterone complex and thus the whole dimer without the ligand was 
used as a search model in AMoRe. Clear solutions were obtained for both 
structures using data between 15.0 and 3.5 A for the hAR LBD and 12.0 and 3.5 
20 A for the hPR LBD, respectively. 



5 



complex. 
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Refinement ofhAR LBD - R1881 complex 

The molecular replacement solution obtained was refined using X-PLOR 
[Briinger, 1992]. In all refinements and map calculations with X-PLOR a bulk 
5 solvent correction was used and all low resolution data was included. Prior to the 
refinement calculations, a random 5% sample of the reflection data was flagged 
for R-free calculations [Brunger, 1992]. All model interactive visualisation and 
editing was carried out using TURBO [Roussel, 1990]. Refinement started using 
data up to 3.5 A and resolution was gradually extended to 2.4 A. The model was 

10 edited according to the known hAR LBD sequence [Lubahn, 1988] using 2|F 0 |- 
|F C | and |F 0 |-|F C | maps calculated at 3.2 A resolution and simulated annealed omit 
maps. The fast wARP [Lamzin, 1997 ; Perrakis, 1997] molecular replacement 
protocol was also applied after each XPLOR refinement to further improve the 
2|F 0 HF C | electron density map. Prior to its inclusion in the model, the electron 

15 density for the R1881 ligand was clearly visible in all maps. A model for the 
ligand was obtained from the Cambridge Structural Database entry HMESTR 
[Precigoux, 1981; Allen, 1979 ]. The XPLOR topology and parameter 
dictionaries were built using program XPL02D [Kleywegt, 1995]. In the final 
refinement at 2.4 A, 26 water molecules were included in the model, and 

20 individual restrained B-factors were refined for all non-hydrogen atoms. The final 
values of R and R-free were 21.0 % and 29.7 %, respectively. The R-free/R ratio 
is only slightly smaller than expected [Tickle, 1998] for the number of atoms and 
reflections used in the refinement. The refinement results and statistics are shown 
in Table 2. 

25 
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Table 2. Final refinement statistics for hAR LBD and hPR LBD complexed 
with R1881 



* calculated with SIGMAA [Centre, 1999 ; Read, 1986]. 



R1881 in complex with 


hAR LBD 


hPRLBD 


Final R-factor (%) 


21.0 


21.7 


Final R-free (%) 


29.7 


34.3 


Number of non-hydrogen protein atoms 


2044 


4027 


non-hydrogen protein atoms missing 


22 


32 


non-hydrogen ligand atoms 


21 


42 


solvent molecules 


26 


1 


Estimated overall r.m.s. coordinate error (A) * 


0.47 


0.53 


Model r.m.s. deviations from ideality: 






Bond distances (A) / Bond angles (°) 


0.01 / 1.7 


0.02/4.4 


Average B values (A 2 ): 






Main-chain / Side-chain 


48.3/52.1 


33.2/28.7 


Ligand / Solvent 


45.2/49.2 


10.2/3.6 



5 



Refinement ofhPR LBD - R1881 complex 

The molecular replacement solution obtained was refined using REFMAC 
[Murshudov, 1997] using the maximum-likelihood approach. Bulk solvent 

10 scaling of Fo and Fc was applied based on Tronrud's solvent correction and all 
available data with no sigma cut-offs were used. All map calculations were done 
including calculated F-values for missing reflections. To avoid model bias, 
calculated maps using only Fo were checked. After the first refinement step the 
sigmaA-weighted calculated 2|F 0 |-|F C ) and |F 0 |-|F C | maps were inspected using the 

15 program O [Jones, 1991 ] and electron density of the ligand was clearly observed. 
The ligand was build up in SYBYL6.5 (Tripos Inc., 1998) and was included in 
further refinement steps. A dictionary file for distance restraints for the R1881 
molecule was prepared using MAKEDICT [Collaborative Computational Project 
Number 4, 1994]. The model was furthermore refined with alternating cycles of 

20 interactive model building and iterative refinement steps. Towards the end of the 
refinement, only one water molecule in the LBP was added. Although some more 
possible water sites were located in the electron density we decided not to include 
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them in the model due to the low resolution and missing data. The final model 
comprises 4027 protein atoms, 42 ligand atoms and 1 water molecule with final R 
values of R = 21.7 % and R-free = 34.3 % 5 respectively. A summary of the 
refinement and model statistics is included in Table 2. 

5 

Results 2 



Structure analysis and comparison of the hAR LBD - R1881 and the hPR LBD - 
R1881 complexes 

10 

Both crystal structures were analysed with PROCHECK [Laskowski, 1993] and 
their stereochemical quality parameters were within their respective confidence 
intervals. In the Ramachandran cp,(|) plot for the non-proline and non-glycine 
residues (not shown) 87.7% for the hAR LBD - R1881 and 85% for the hPR 

15 LBD - Rl 881 structures respectively lie within the most favoured regions. For the 
hAR LBD - R1881 complex no residue is outside the normally allowed regions 
whereas in the hPR LBD - R1881 complex two residues are located in disallowed 
regions (Asn 705 and Ser 793 in molecule A) and three residues (Thr 796 in 
molecule A, Asn 705 and Ser 793 in molecule B) are located in generously 

20 allowed regions. These residues are not involved in ligand binding and are 
located in loop regions which are most probably not involved in ligand 
recognition. In the hAR LBD - R1881 structure there is only one close contact 
(2.6 A) between Met895 and Ala896 carbonyl oxygens. In the hPR LBD - R1881 
structure some close contacts were observed but due to the resolution and 

25 completeness of the data this is not surprising. The overall fold of the hAR and 
hPR LBD - R1881 structures is very similar, and also with that of hPR LBD 
complexed with progesterone [Williams, 1998]. On the basis of the secondary 
structure calculated with PROCHECK [Laskowski, 1993] according to Kabsch & 
Sander [Kabsch, 1983], the hAR LBD - R1881 structure contains 9 a-helices, 

30 two 3 io helices and four short p-strands associated in two anti-parallel p-sheets. 
The helices are arranged in the typical 'helical sandwich' pattern as in hPR LBD - 
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progesterone complex [Williams, 1998] and helices H4, H5 and H10, Hll are 
contiguous. There are a few minor variations in secondary structure between hAR 
LBD - R1881 and hPR LBD - progesterone but probably the most interesting is 
that in hAR LBD - R1881 helix H12 seems to be split into two shorter helical 
5 segments, with nine and five residues each respectively. This observation was not 
seen in the hPR LBD - R1881 structure, although a bending of helix H12 is also 
seen here. Figure 1 shows a comparison between the amino acid sequences of 
hAR LBD and hPR LBD. A ribbon diagram of the hAR LBD - Rl 881 structure is 
shown in Figure 3 along with a superimposed C a -trace of the hAR LBD - R1881 

10 and hPR LBD - R1881 molecules. The crystal structure coordinates of hAR LBD 
- R1881 were superimposed with those of hPR LBD - R1881 (molecule B) and 
hPR LBD - progesterone (molecule A) using LSQKAB [Kabsch, 1976]. For the 
superposition the main chain atoms except three N-terminal (Cys 669-Pro 671) 
and one C-terminal (Thr 918) residues were used. The r.m.s. coordinate 

15 deviations were 1.16 and 1 .22 A respectively, again an indication of the similarity 
of the overall fold of these three molecules. In hAR LBD - R1881, Cys 669 and 
Cys 844 are very close and a disulphide bridge between them was modelled, 
based on the electron density. However there is no supporting biochemical 
evidence so far and it should be noted that the temperature factors of both 

20 cysteine residues and the adjacent residues are very high. A cis peptide bond is 
found at position Pro 849 in hAR LBD - R1881. 

Example 3 

25 Comparative modeling 

A model of the hAR LBD was built based on the coordinates of the hPR LBD - 
progesterone complex (molecule A) [Williams, 1998]. Amino acid substitutions 
were made based on the sequence alignment in Figure 1 using the Insight 98.0 
30 software (MSI Inc., San Diego, CA USA 1998). Water molecules as observed in 
the hPR LBD crystal structure (molecule A) were included in the calculations. 
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Soaking of the initial model and the energy minimisation protocols applied are 
described in detail elsewhere [Letz, 1999]. 

Results 3 

5 

Comparison of model and crystal hAR LBD structure 

The model and the crystal structure of the hAR LBD are very similar with respect 
to their overall structure, the ligand binding pocket (LBP) and the ligand 

10 orientation. The root-mean-square (r.m.s.) deviation between 149 equivalent C a 
atoms in helices between the model and crystal structure of the hAR LBD is 1.09 
A. It is comparable to the r.m.s. deviation of 0.84 A and 0.85 A between the 
crystal structures of the hAR LBD and the hPR LBD - progesterone complex, on 
which the model of the hAR LBD was based on and the hAR LBD model and the 

15 hPR LBD - progesterone crytsal structure, respectively. The most striking 
difference between the model and the crystal structure was found in the region of 
helix H6. In the hAR LBD crystal structure, this region was identified as an oc- 
helix (calculated with the Kabsch & Sander algorithm [Kabsch, 1983] as 
implemented in Insight98.0 (MSI Inc., San Diego USA, 1998), whereas in the 

20 hPR LBD - progesterone complex (molecule A) no a-helix is observed. There is 
also no a-helix in the hAR LBD model in this area. The ligand orientation in both 
the hAR LBD - R1881 model and crystal structure is very similar. The same 
hydrogen bonds are found between the OS of R1881 and Arg 752 with a distance 
of 3.0 A in the crystal and 3.4 A in the model structure, respectively. In the ligand 

25 D-ring, 017 is within hydrogen bond distance to Asn 705 and Thr 877, 3.1 and 
3.0 A in the crystal structure, 2.6 and 3.3 A in the model structure, respectively. 

Discussion 

30 Comparative modelling 
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The model of the hAR LBD which is based on the hPR LBD - progesterone 
complex is very similar to the hAR LBD crystal structure with respect to the 
overall fold and ligand orientation. The most striking differences were a stronger 
bending of helices H10 and HI 1 in the model compared to the crystal structure of 
5 the hAR LBD. We modelled helix H9 with the same length as in the crystal 
structure of the hPR LBD - progesterone complex. In the hAR LBD crystal 
structure it is one helical turn shorter. This region is far away from the LBP and 
therefore has no influence on the size of the LBP. Our model structure differs 
from other published models [Yong, 1998] with respect to the secondary 
10 structure alignment, as the authors based their model on the crystal structure of 
the RARa LBD [Bourguet, 1995]. The secondary structure assignment by Yong 
et ah as compared to the hAR LBD crystal structure is similar between helices 
H3 and H10, the assignment differs most for helices Hll, H12 and the additional 
helix at the C-terminal end. 

15 

Ligand binding pocket(LBP) interactions 

There are a total of 18 amino acid residues in hAR LBD and hPR LBD that 
interact with the bound ligand (either R1881 or progesterone). These residues are 

20 highlighted in Figure 1 and included in Figure 4. Most of these residues are 
hydrophobic and interact mainly with the steroid scaffold, while a few are polar 
and may form hydrogen bonds to the polar atoms in the ligand. The hydrogen- 
bonding scheme to 03 of R1881 and progesterone is similar but not identical, as 
shown in Figure 4. In the hAR LBD - R1881 crystal structure, this oxygen atom 

25 forms a hydrogen bond to Arg 752 (Arg 766 in hPR LBD), but in contrast with 
the hPR LBD - progesterone complex the distance of 3.9 A to Gin 71 1 (Gin 725 
in hPR LBD) does not allow a hydrogen bond. There is a water molecule near 03 
that is hydrogen-bonded to three other residues with a nearly triangular geometry 
(R752 JSP 1 , M745 O and Q711 O el in hAR LBD; R766 N nl , M759 O and Q725 

30 O el in hPR LBD - progesterone). Two of these residues are acceptors, therefore a 
third acceptor atom (OS in either progesterone or R1881) in a direction 
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perpendicular to the plane of the triangle is unlikely, also due to unfavourable 
geometry. The water molecule hydrogen-bonded to Q711 N e2 in hAR LBD 
(Q725 in hPR LBD) has hydrogen bonds to two other residues (V685 O and F764 
O in hAR LBD, 1699 O and F777 O in hPR LBD) and in hAR LBD it is 
5 hydrogen bonded to a further water molecule, the overall hydrogen bond 
geometry being nearly tetrahedral. In the hPR LBD - R1881 structure, the 
ligands in molecules A and B possess slightly different hydrogen bond patterns. 
In molecule A, 03 of R1881 forms two hydrogen bonds (3.2 A to Gin 725 N 82 
and 2.9 A to Arg 766 N* 12 ). One water molecule was located in the F 0 -F c electron 

10 density with the same tetrahedral geometry as observed in the hAR LBD - Rl 88 1 
structure. In molecule B, the ligand is in a slightly different position and the 
hydrogen bond pattern differs from that observed in molecule A. The 03 of 
R1881 forms again one hydrogen bond to Arg 766 ISP 2 with a distance of 2.9 A 
whereas the distance to Gln725 N e2 is now 3.7 A, outside the acceptable range for 

15 a hydrogen bond. 

The 17 P hydroxyl group of R1881 forms different hydrogen bonds, when bound 
to hAR LBD or hPR LBD (Figure 4). In hAR LBD, the 17 p hydroxyl group is 
hydrogen-bonded to Asn 705 (2.8 A) and Thr 877 (2.9 A). The same pattern is 

20 observed in molecule B of hPR LBD - R1881 complex where the 17p hydroxyl 
group of R1881 also forms strong interaction to Asn 719 (2.8 A), whereas in 
molecule A the corresponding distance of 3.5 A is only in the range of a weak 
interaction. In contrast to the hAR LBD, in both hPR LBD monomers Cys 891 
(Thr877 in hAR LBD) shows only a weak interaction with the 17p hydroxyl 

25 group of R1881 (3.7 A in molecule A and of 4.0 A in molecule B, respectively). 
However, the relative orientation of the Cys 891 side chain with regard to the 
hydroxyl group does suggest that this interaction is relevant to the binding of the 
ligand. 
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Structural basis for ligand specificity in hAR LBD 

The ligand R1881 binds with a relative binding affinity (RBA) of 290 to the wild- 
type hAR as compared to a value of 180 for DHT and 100 for testosterone, 
respectively [Teutsch, 1994]. As for the wild-type hPR, the relative binding 
affinity of R1881 is 190 with respect to progesterone (RBA = 100). Overall, 
R1881 shows comparable good binding affinities to both receptors, which is also 
reflected in the orientation of the ligand in the LBPs of the hAR LBD and the 
hPR LBD (Figure 4). Thr 894 in hPR LBD is replaced by Leu 880 in hAR LBD 
and the C 82 atom of this leucine makes a van der Waals contact (3.9 A) with the 
oxygen atom of the 17p hydroxyl group of R1881. This bulkier side chain, along 
with the substitution of Cys 891 in hPR LBD by Thr 877 in hAR LBD is very 
likely responsible for the specific recognition of the 17p hydroxyl group of 
R1881 contrary to the 17(3 acetyl group of progesterone. Not only there is an 
extra polar residue (Thr 877 besides Asn 705 which is conserved in AR) which 
can form an additional hydrogen bond to the 17(3 hydroxyl oxygen, but the 
directed decrease in pocket volume caused by the change of Thr894 to Leu880 
will very likely inhibit the binding of other bulkier ligands such as progesterone. 
As previously noted [Williams, 1998] there are no strong hydrogen-bonded 
interactions between the O20 carbonyl oxygen atom of progesterone and the 
protein in hPR LBD indicating that the recognition of this group is probably 
made only through hydrophobic and steric interactions. The hPR LBD can bind 
R1881 as well as progesterone and, as seen from the above discussion of the 
hydrogen bonding and van der Waals interaction pattern between protein chain 
and ligand in the crystal structure, the hPR LBD molecule appears to exhibit two 
different binding modes for R1881, one resembling that of progesterone (03 with 
two hydrogen bonds to the protein chain and the 170 function weakly interacting 
with the protein chain) and one similar to that of hAR LBD (03 with only one 
hydrogen bond to the protein chain and the 17(3 function also hydrogen bonded to 
the protein chain). However, these binding modes do not seem to imply 
significant changes in ligand position and orientation within the LBP. 
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Mutations 

We analysed whether the mutated amino acid residues are predominantly found 
in the interior of the protein or at the surface. Comparison of the solvent 
accessibility of these residues revealed that a nearly even distribution is found 
between buried, medium or fully accessible residues. Table 3 lists all those 
mutations in or near the AR ligand binding pocket (LBP) which gjre known to be 
involved in AIS and prostate cancer (PC), their location with respect to secondary 
structural elements as well as the potential effect of the mutations. 
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Table 3: hAR LBD mutations observed in prostate cancer, CAIS and 
PAIS/MAIS. For convenience, the equivalent positions of the amino acid 
residues (aar) in the hPR LBD are given. Bold numbers indicate available mutant 
data in the PR. All mutations are taken from the androgen receptor gene 
5 mutations data base (Gottlieb et al. 1998 and references therein) 
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Mutations are reported for 12 of the 18 residues considered to interact with the 
ligand R1881 within 4.0 A as discussed above, as well as two additional residues 
within 5.0 A of the ligand (G708 and V746 in hAR LBD, G722 and Val760 in 
hPR LBD). In some cases the same amino acid can be mutated into different 
5 residues, e.g. T877A and T877S. For most of these mutations, a structural effect 
can be associated with the substitution. For example, when Met 749 in hAR LBD 
is substituted by the branched amino acid valine, the C Y side chain atoms would 
become too close to the ligand. The location of these mutations in the three- 
dimensional structure of hAR LBD - R1881 is shown in Figure 5, and it can be 
10 seen that the mutations involved in the prostate cancer (PC) cluster mainly near 
the R1881 17p hydroxyl group while those involved in AIS are arranged mainly 
around the other parts of the ligand. One notable exception is Met 749 which has 
mutations implicated in both PC and CAIS and is located in the vicinity of R1881 
03 5 opposite from the other PC-implicated mutations. 

15 

Mutations in the LBP observed in the prostate cancer cell line LNCaP 

The prostate tumor cell line LNCaP contains an AR receptor showing a 
significant increased binding affinity for gestagenic and estrogenic steroids but 

20 shows identical R1881 binding (Veldscholte et al. 1990). A single point mutation 
(T877A) is associated with this abnormal behaviour. With an alanine at this 
position an important hydrogen bond partner for the 17p hydroxyl group in 
R1881, testosterone or dihydrotestosterone (DHT) would be missing, but the 
other hydrogen bond partner, Asn 705 , involved in ligand binding could still 

25 orient the ligand in the LBP. Mutagenesis experiments of hPR emphasised the 
critical role of this asparagine residue in ligand interaction (Letz et al 1999). In 
the crystal structure of the hPR LBD - progesterone complex, Cys 891 is found at 
the position of Thr 877, but no hydrogen bond of the 17p acetyl group of 
progesterone was observed although Cys 891 is relatively close (4.3 A in 

30 molecule A, 4.4 A in molecule B) to O20 of progesterone. However, bacterial 
extracts of a mutated hPR LBD (C891S or C891V) showed a large decrease in 
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relative binding affinity for progesterone and the purified mutated hPR LBD was 
completely inactive in binding assays [Letz, 1999]. 



Mutations in the LBP observed in CAIS 

5 

The three mutations in the hAR LBP described for CAIS are substitutions that 
considerably change the size of the respective amino acid side chains, N705S 
[Bellis, 1992; Pinsky, 1992], L707R [Lumbroso, 1996] and M749V [Bellis, 1992; 
Jakubicza, 1992]. This change in size alters the LBP such that the local structure 
10 and interactions to the ligand are disturbed. 

In the AR LBD and PR LBD crystal structures, Asn 705 or Asn 719 respectively 
is one of the hydrogen bond partners to the ligand R1881, but not to 
progesterone. If this residue is substituted to Val in hPR LBD, only a moderate 

15 effect was observed on the binding activity of progesterone, considering the Kd 
and half-life values [Letz, 1999]. In the crystal structure of the hPR LBD - 
progesterone complex, Asn 719 is involved in the stabilisation of the loop 
between HI 1 and H12, via hydrogen bond between Asn 719 N 52 and Glu 904 O. 
In the hAR LBD, an identical stabilisation is found, by means of a hydrogen bond 

20 between Asn 705 N 82 and Asp 890 O. A N705S mutation, observed in a patient 
suffering from CAIS would have a two-fold effect, destablilization of the 
structure and loss of a hydrogen bond partner for the ligand. 



In the described hAR mutant L707R, the structure integrity disturbance is also 
25 reflected in the binding constants. Considering a van der Waals cutoff distance of 
4.0 A, the side chain of Leu 707 makes close contacts with the A-ring of R1881 
as well as five residues in the protein chain: V685, A687, Q71 1, F764 and L768. 
The first two residues are located in a loop region between H2 and H3, the third 
is located within H3 and is involved in the hydrogen bonding pattern of a water 
30 molecule near the 03 atom of Rl 881 , and the final two belong to each of the two 
strands SI and S2 of the first short p-sheet. Clearly, such a variation in the size of 
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the side-chain would have a large impact, not only in the LBP but in disrupting 
the overall protein fold itself. The mutated receptor shows undetectable binding 
affinity to the ligand R1881 as obtained by Scatchard plot analysis and no 
transcriptional activity is found [Lumbroso, 1996]. 

5 

Mutations in the LBP observed with PAIS/MAIS 

Seven described mutations in the hAR LBP are associated with PAIS/ MAIS, and 
multiple substitutions were observed for amino acids at position 708 [Albers, 

10 1997 ] and 742 [Bevan, 1996 ]. In the hAR LBD crystal structure, a substitution 
of Gly 708 to alanine should be tolerated whereas a valine at this position would 
interfere with ligand binding. The closest distance of the C atom of an alanine 
residue to the ligand would be 3.0 A, however, the Cg atoms of a valine would be 
too close to the ligand atoms (1.5 A). The substitution of the equivalent Gly 722 

15 in the hPR receptor to serine does not influence the binding of agonists, but rather 
that of the antagonist RU486 [Benhamou, 1992]. 

In all steroid receptors, the steroid is stabilised by a hydrogen bond between the 
A-ring of the ligand and an arginine (Arg 752 in hAR). A smaller amino acid 

20 residue at this position (mutation to glutamine in hAR) should have a dramatic 
impact on ligand binding as the stabilisation of the A-ring would be severely 
hampered due to the lack of a electrostatic interaction (Cabral et al 1998, 
Komori, 1998). A similar effect has been reported for the hPR receptor where a 
mutation (R766H) resulted in a low or even non-detectable binding affinity. The 

25 side-chain of histidine is too small to serve as a hydrogen bond partner to the 03 
atom in progesterone [Letz, 1999]. 

In the hAR mutation F764S, R1881 shows a similar binding affinity as the wild 
type receptor, but a rapid ligand dissociation is observed [Marcelli, 1994]. In the 
30 crystal structure, Phe 764 is involved in the stabilisation of the A-ring position. A 
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smaller amino acid like serine would allow binding of the ligand, but very likely 
not contribute to the tight binding of R1881. 

Mutations M742V or M742I both dramatically reduce the binding affinity of 
5 R1881 [Bevan, 1996]. Although He and Val fit into the LBP, the changed 
environment is less tightly packed and the LBP is enlarged, thus affecting the 
binding of the ligand. 

However, not all mutations can be related to a disturbance of the structure. In 
10 case of the M787V mutation in the hAR LBD, it was found by Scatchard analysis 
that R1881 and DHT binding was undetectable or strongly reduced [Nakao, 
1992]. The lack of androgen binding was thought to be the cause for AIS. In the 
crystal structure, a methionine to valine substitution could be tolerated. The lack 
of binding affinity found for R1881 may account for a destabilisation in the LBP 
15 as the Met 787 side chain is in van der Waals contact with other amino acids like 
Val 760 and Leu 887 as well as ligand atoms. 

Example 4 

20 

Modified method for isolating hPR-LBD 
Purification ofhPR LBD with R1881: 

25 The pGEX-KG-hPR LBD construct rather than the pGEX~KG- hAR LBD 
construct was used for fermentation. As a result, compared to "normal" hPR 
LBD purification, there were a few differences at the beginning of the 
purification procedure. These differences were related to the size of the construct 
and to different pH values, salt and additive concentrations: 

30 
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Construct "normal" hPR LBD purification: pGEX-2T-hPR LBD construct (Gly-hPR LBD 

677-933), this time: pGEX-KG-hPR LBD: (GSPGISGGGGGI-hPR LBD 678-933) 
(N-terminal end extended by 10 residues). 

pH Reduction from pH 8.0 to pH 7.3 (instead of pH 7.5) 

NaCl Increase from 200 to 300 mM 

EDTA Increase from 0.5 to 5 mM 

DTT Increase from 5 to 10 mM 

R1881 100 |liM on lysis and binding to glutathione sepharose column 

Urea Reduction from 2 M to 0 M (purification without urea!) 

Results 4 



Purification was successful and the protein was concentrated to 3 mg/ml (total 
5 protein 1 .0 mg after SDS PAGE 

Example 5 

HAR-LBD-Ligand complexes 

10 

Energy minimisation calculations were performed with the ligands R1881, 
testosterone and 19Nor-testosterone. In a first step the protocols used in the 
calculations were optimized such that the energy minimisation calculation of the 
hAR LBD — R1881 complex reproduced the interactions between the protein and 
15 the ligand as observed in the crystal structure of the same complex especially the 
hydrogen bond partners of the 03 and 017 atoms of the ligand with the protein, 
i.e. Arg752 ? Gln711 and Asn705. Then the same protocols were used for the 
calculations of the hAR LBD - testosterone and the hAR LBD-19Nor- 
testosterone complexes. 
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5 Results 5 

The results of the energy minimisation calculations confirm the hydrogen bond 
interactions at atom 03 of both testosterone and 19Nor-testosterone as observed 
in the crystal structure between R1881 and the hAR LBD (with Arg752 and 
10 GLnVll). However, the interaction partners of the 017 atom at the D-ring are 
different due to the methyl substituent attached to position 10 of the steroid 
skeleton (position 19). 

In case of the ligand 19Nor-testosterone, the 017 atom interacts with the side 
15 chain of Asn705. The calculations of the hAR LBD in complex with the ligand 
testosterone showed a shift in the orientation of the ligand in the ligand binding 
pocket (LBP) most likely due to the presence of the methyl group attached to 
position 10 of the steroid scaffold. Here, an interaction of the Ol 7 atom with the 
side chain of Thr877 is observed in the calculations. The methyl group at that 
20 position in the ligand would be too close to amino acid residues Trp741 and 
Met745. In order to accommodate this ligand in the LBP, the ligand is shifted as 
well as the side chains of the amino acid residues Trp741 and Met745. 

The amino acid residues of the hAR LBD within a radius of 4 A around the 
25 respective ligands are the same for R1881 and 19Nor-testosterone. Due to the 
slight shift of testosterone of about 1.5 A in the D-ring area, amino acid residues 
Trp741 and Ile899 are now farer away from testosterone. 
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SUMMARY 

A crystal comprising an androgen receptor ligand binding domain (AR-LBD) is 
5 provided. The crystal structures of the human Androgen Receptor (hAR) in 
comparison with the human Progesterone Receptor (hPR) Ligand Binding 
Domains (LBDs) in complex with the same ligand metribolone (R1881) is also 
provided. The three-dimensional structures of the hAR LBD as well as the hPR 
LBD show the typical nuclear receptor fold. The change of two residues in the 

10 ligand binding pocket (LBP) between hPR and hAR was identified as the most 
likely source for the specificity of the R1881 ligand binding to hAR LBD. The 
AR-LBD amino acid residues are Leu 880 and Thr 877. The corresponding PR 
amino acid residues Thr894 and Cys891. In addition, there are three other amino 
acid changes which maybe involved in binding of ligands other than Rl 88 1 . The 

15 AR amino acid residues are Gin 783, Met 749 and Phe 876. The PR amino acid 
residues are Leu 797, Leu 763 and Tyr 890. The structural implications of the 14 
known mutations in the LBP of the hAR LBD associated with either prostate 
cancer or the partial or complete androgen receptor insensitivity syndrome were 
analysed. The effects of most of these mutants could be explained on the basis of 

20 the crystal structure. 

In one aspect, the present invention provides a method of identifying a compound 
that modulates (ie increases or decreases) AR activity, comprising: modeling test 
compounds that fit spatially into an AR LBD of interest using a model of the 

25 AR-LBD or portion thereof, screening the test compounds in an assay, for eg, a 
biological assay, characterised by binding of a test compound to the LBD and 
identifying a test compound that modulates AR activity wherein the structural 
model comprises structural co-ordinates of the LBD amino acid residues: L701; 
L704; N705; L707; Q711; M742; L744; M745; M749; R752; F764; Q783; 

30 M787; F876; T877; L880; F891 ; M895 or a homologue thereof. 
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In another aspect, the present invention relates to a computer readable medium 
having stored thereon a model of a crystal comprising an LBD structure of the 
AR-LBD. 

5 In a further aspect, the present invention relates to a computer readable medium 
having stored thereon a model of a crystal comprising an AR-LBD wherein said 
model is built from all or part of the X-ray diffraction data shown in Table 1 
and/or Table 2. 

10 In an even further aspect, there is provided the use of the structural co-ordinates 
provided in Table 4 for the identification of a ligand or for building a crystal 
structure for an AR-LBD. 

In another aspect, the present invention relates to a computer controlled method 
15 for designing a ligand capable of binding to the AR receptor comprising: 

(i) providing a model of the crystal structure of the AR-LBD; 

(ii) analysing said model to design a ligand which binds to the LBD; and 

(iii) determining the effect of said ligand on said AR-LBD. 

20 In a further aspect, there is provided a machine-readable data storage medium, 
comprising a data storage material encoded with machine readable data which, 
when using a machine programmed with instructions for using said data, is 
capable of displaying a graphical three dimensional representation of a crystal or 
a homologue of said crystal. 

25 

The present invention also provides a computer comprising such a storage 
medium. 

The present invention also provides the use of such a computer in an industrial 
30 context, such as identifying putative ligands. 
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In another aspect, there is provided a method for homology modelling a crystal 
comprising an AR-LBD or a homologue thereof comprising: 

(i) aligning the sequence of the AR-LBD (SEQ ID No 1 or SEQ ID No 2) or 
an AR-LBD homologue with the AR-LBD sequence and incorporating this 

5 sequence into the AR-LBD model; 

(ii) subjecting a preliminary AR-LBD model to energy minimisation resulting 
in an energy minimised model; 

(iii) remodeling the regions of said energy minimised model where 
stereochemistry restraints are violated; and 

10 (iv) obtaining a final homology model. 



Various modifications and variations of the described methods and system of the 
invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in 

15 connection with specific preferred embodiments, it should be understood that the 
invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the 
invention which are obvious to those skilled in chemistry or biology or related 
fields are intended to be covered by the present invention. All publications 

20 mentioned in the above specification are herein incorporated by reference. 
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CLAIMS 

1. A crystal comprising an androgen receptor ligand binding domain (AR- 
LBD). 

5 

2. A crystal comprising a ligand binding domain (LBD) wherein the LBD is 
arranged in an a-helical sandwich comprising preferably the a-helices: HI, H3 ? 
H4, H5, H6, H7, H8, H9, H10, Hll and H12; preferably two 3i 0 helices; and 
preferably four short p strands (Sl 5 S2, S3 and S4) associated in two anti-parallel 

10 p-sheets; 

wherein the helices H4, H5, H10 and HI 1 are preferably contiguous helices; and 
wherein 

15 

either helix H6 is preferably an a-helix and/or 

helix HI 2 comprises preferably two helical segments of preferably 9 amino acid 
residues and preferably 5 amino acid residues. 

20 

3 . A crystal according to claim 2 wherein the LBD is an AR-LBD. 

4. A crystal according to any one of claims 1-3 wherein the LBD is a human 
AR-LBD. 

25 

5. A crystal according to any one of claims 1-4 wherein the LBD comprises 
the sequence presented as SEQ ID No 1 or a homologue or a mutant thereof. 

6. A crystal according to any one of the preceding claims wherein the LBD 
30 comprises the secondary structure presented as SEQ ID No 2 or a homologue 

thereof 
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7. A crystal comprising a ligand binding pocket (LBP); wherein the LBP is 
defined by the following amino acid residue structural co-ordinates: L701; L704; 
N705; L707; Q71 1; M742; L744; M745; M749; R752; F764; Q783; M787; F876; 

5 T877; L880; F891; M895; or a homologue thereof. 

8. A crystal comprising an LBP wherein the LBP is defined by a mutation or 
substitution or derivatisation in or of any one or more of the structural co- 
ordinates of the LBD amino acid residues as defined in claim 7. 

10 

9. A crystal according to claim 8 wherein the mutation is selected from the 
group consisting of any one or more of: L701H; M749I; T877A; T877S; L880Q; 
F891L;N705S; L707R; M749V; G708A; G708V; M742V; M742I; M745T; 
V746M; R752Q; F764S; M787V; or a homologue thereof 

15 

10. A crystal according to any one of the preceding claims wherein the crystal 
belongs to the space group P2j ? 2\, 2\, and having the unit dimensions a = 
58.28A, b = 66. 14A, c = 71.72A. 

20 11. A crystal according to any one of the preceding claims wherein the crystal 
further comprises a ligand bound to the LBD or a portion thereof 

12. A crystal according to claim 11 wherein the ligand is metribolone 
(R1881) or a mimetic thereof 

25 

13. A method of screening for a ligand capable of binding to a LBD wherein 
the method comprises the use of a crystal according to any one of claims 1-12. 

14. A method for screening for a ligand capable of binding to a LBD wherein 
30 the LBD is defined in claim 2 and/or claim 3 and/or claim 4 and/or claim 7 and/or 
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claim 8; the method comprising contacting the LBP with a test compound, and 
determining if said test compound binds to said LBP. 

15. A method according to claim 14 wherein the method is to screen for a 
5 ligand useful in the prevention and/or treatment of an androgen related disorder 

wherein the androgen related disorder is selected from the group consisting of 
androgen insensitivity syndrome (AIS), partial androgen insensitivity syndrome 
(PAIS), mild androgen insensitivity syndrome (MAIS), complete androgen 
insensitivity syndrome (CAIS) and prostrate cancer (PC). 

10 

16. A process comprising the steps of: 

(a) performing the method according to claim 13 or claim 14 or claim 15; 
15 (b) identifying one or more ligands capable of binding to a LBD; and 

(c) preparing a quantity of those one or more ligands. 

17. A process comprising the steps of: 

20 

a) performing the method according to claim 13 or claim 14 or claim 15; 

(b) identifying one or more ligands capable of binding to a LBD; and 

25 (c) preparing a pharmaceutical composition comprising those one or more 
identified ligands. 

18. A process comprising the steps of: 

30 (a) performing the method according to claim 1 3 or claim 14 or claim 15; 
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(b) identifying one or more ligands capable of binding to a LBD; and 

(c) modifying those one or more identified ligands capable of binding to a 
LBD; and 

5 

(d) performing said method according to claim 13 or claim 14 or claim 15; and 

(d) optionally preparing a pharmaceutical composition comprising those one or 
more modified ligands. 

10 

19. A ligand identified by the method of claim 13 or claim 14 or claim 15 
wherein the ligand is a LBD binding compound. 

20. A ligand according to claim 19 wherein the ligand is capable of interacting 
15 with a LBD region located in helices H4 and H5 of the LBD. 

21. A ligand according to claim 19 wherein the ligand is capable of interacting 
with one or more of: Asn 705, Met 749, Gin 783, Phe 876, Thr 877, Leu 880 of 
an AR-LBD. 

20 

22. A ligand according to claim 21 wherein the ligand is capable of interacting 
with one or more of: Met 749, Gin 783, Phe 876, Thr 877, Leu 880 of an AR- 
LBD. 

25 23. A ligand according to claim 21 or claim 22 wherein Hie ligand is capable of 
interacting with one or more of: Thr 877, Leu 880 of an AR-LBD. 

24. A ligand according to claim 19 wherein the ligand is capable of interacting 
with Asn 705. 

30 
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25. A ligand according to claim 19 wherein the ligand is capable of fitting 
spatially into a LBP wherein the LBP is defined by the structural co-ordinates of 
the mutated amino acid residues L701H; M749I; T877A; T877S; L880Q; 
F891L;N705S; L707R; M749V; G708A; G708V; M742V; M742I; M745T; 

5 V746M; R752Q; F764S; M787V, or a homologue thereof. 

26. A pharmaceutical composition comprising a ligand according to any one of 
claims 21-25 and a pharmaceutically acceptable carrier, diluent, excipient or 
adjuvant or any combination thereof. 

10 

27. A method of preventing and/or treating an androgen related disorder 
comprising administering an ligand according to any one of claims 21-25 and or a 
pharmaceutical according to claim 26 wherein said agent or said pharmaceutical is 
capable of modulating an AR-LBD to cause a beneficial preventative and/or 

15 therapeutic effect. 

28. A method according to claim 27 wherein the androgen related disorder is 
that defined in claim 15. 

20 29. Use of a ligand according to any one of claims 21-25 in the preparation of a 
pharmaceutical composition for the treatment of an androgen related disorder. 

30. Use of a crystal comprising an AR-LBD in the preparation of a medicament 
to prevent and/or treat androgen related disorders. 

25 

31. Use according to claim 30 wherein the AR-LBD is used to screen for 
ligands that can modulate the activity of the AR-LBD. 



32. An AR-LBD agonist, wherein the AR-LBD is that defined in any one of 
30 claim 1 and/or claim 3 and/or claim 4. 
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33. An AR-LBD antagonist wherein the AR-LBD is that defined in any one of 
claim 1 and/or claim 3 and/or claim 4. 

34. A crystal comprising an androgen receptor ligand binding pocket (AR- 
5 LBP). 

3 5 . An AR-LBD in a crystal form. 

36. A method for predicting, simulating or modelling the molecular 
10 characteristics and/or molecular interactions of a ligand binding domain (LBD) 

comprising the use of a computer model, said computer model comprising, using, 
or depicting the structural coordinates of a ligand binding domain as provided in 
Table 4 or Table 5 to provide an image of said binding ligand domain and to 
optionally display said image. 

15 

37. A method according to claim 36 wherein said method further comprises the 
use of a computer model comprising, using, or depicting the structural coordinates 
of a ligand to provide an image of said ligand and to optionally display said image. 

20 38. A method according to claim 37 wherein said method further comprises 
providing an image of said ligand in association with said LBD and optionally 
displaying said image. 

39. A method according to claim 38 wherein said ligand is manufactured and 
25 optionally formulated as a pharmaceutical composition. 

40. A crystal substantially as described herein and with reference to the 
accompanying Figures. 
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coordinates of hAR LBD in complex with the ligand R1881 
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77 


O 


ATOM 


2552 


OW 


WAT 


1012 


7 


.032 


36 


.581 


14 


.890 


1. 


00 


40. 


54 


O 


ATOM 


2555 


OW 


WAT 


1013 


18 


.090 


34 


.834 


6 


.262 


1. 


00 


66. 


21 


O 


ATOM 


2558 


OW 


WAT 


1014 


5 


.741 


29 


.774 


22 


.458 


1 . 


00 


37. 


52 


O 


ATOM 


2561 


OW 


WAT 


1015 


29 


. 879 


8 


.063 


10 


. 984 


1 . 


00 


53. 


83 


O 


ATOM 


2564 


OW 


WAT 


1016 


17 


.517 


11 


.963 


20 


.575 


1. 


00 


51. 


58 


O 


ATOM 


2567 


OW 


WAT 


1017 


7 


. 674 


37 


.499 


32 


.487 


1 . 


00 


46. 


92 


O 


ATOM 


2570 


OW 


WAT 


1018 


-4 


.737 


36 


.972 


-13 


.426 


1. 


00 


75. 


71 


O 


ATOM 


2573 


OW 


WAT 


1019 


1 


.207 


32 


.439 


14 


.426 


1 . 


00 


45. 


58 


O 


ATOM 


2576 


OW 


WAT 


1020 


-5 


.348 


34 


.883 


24 


.137 


1. 


00 


62. 


76 


0 


ATOM 


2579 


OW 


WAT 


1021 


10 


.790 


29 


.074 


3 


.632 


1. 


00 


61. 


00 


0 


ATOM 


2582 


OW 


WAT 


1022 


1 


.314 


20 


.835 


2 


.312 


1 . 


00 


46. 


40 


0 


ATOM 


2585 


OW 


WAT 


1023 


3 


.112 


18 


. 553 


3 


.147 


1 . 


00 


65. 


78 


0 


ATOM 


2588 


OW 


WAT 


1024 


27 


.562 


8 


. 424 


12 


.882 


1. 


00 


58. 


80 


0 


ATOM 


2591 


OW 


WAT 


1025 


26 


.453 


8 


.689 


17 


. 052 


1 . 


00 


48. 


49 


0 


ATOM 


2594 


OW 


WAT 


1026 


5 


.869 


40 


.798 


18 


.893 


1 . 


00 


48. 


79 


0 
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FIG. / (TABLE 5) 

Coordinates of hPR LBD in complex with R18 81 
Atom Molecule Nr. 





Nr. 


type 


(hPR= 


dimeric) 




X 




Y 




Z 












ATOM 


1 


N 


GLN 


A 


682 


20. 


497 


-20 


.8 62 


-23. 


305 


1. 


. 00 


68 . 


60 


N 


ATOM 


2 


CA 


GLN 


A 


682 


19. 


503 


-21 


.064 


-24. 


380 


1. 


.00 


69. 


91 


C 


ATOM 


3 


C 


GLN 


A 


682 


18. 


229 


-21 


. 697 


-23. 


807 


1 


. 00 


68 . 


48 


C 


ATOM 


4 


O 


GLN 


A 


682 


17. 


143 


-21 


. 664 


-2 4. 


347 


1, 


. 00 


69. 


90 


O 


ATOM 


5 


CB 


GLN 


A 


682 


20. 


020 


-21 


.875 


-25. 


570 


1, 


.00 


71. 


13 


c 


ATOM 


6 


N 


LEU 


A 


683 


18. 


368 


-22 


.287 


-22. 


64 0 


1 


.00 


66. 


32 


N 


ATOM 


7 


CA 


LEU 


A 


683 


17. 


309 


-22 


.526 


-21. 


704 


1 


.00 


62 . 


77 


c 


ATOM 


8 


C 


LEU 


A 


683 


17. 


316 


-21 


.477 


-20. 


609 


1, 


.00 


58 . 


97 


c 


ATOM 


9 


O 


LEU 


A 


683 


16. 


249 


-21 


.443 


-20. 


021 


1 


.00 


55. 


97 


O 


ATOM 


10 


CB 


LEU 


A 


683 


17. 


498 


-23 


.951 


-21. 


162 


1 


. 00 


65. 


72 


c 


ATOM 


11 


CG 


LEU 


A 


683 


16. 


409 


-24 


. 988 


-21. 


409 


1 


.00 


67. 


24 


c 


ATOM 


12 


CD1 


LEU 


A 


683 


15. 


077 


-24 


.271 


-21. 


643 


1 


.00 


68. 


14 


c 


ATOM 


13 


CD2 


LEU 


A 


683 


16. 


692 


-25 


.971 


-22. 


539 


1 


. 00 


68. 


46 


c 


ATOM 


14 


N 


ILE 


A 


684 


18. 


376 


-20 


.707 


-20. 


380 


1 


.00 


57 . 


68 


N 


ATOM 


15 


CA 


ILE 


A 


684 


18. 


409 


-19 


.708 


-19. 


292 


1 


.00 


54 . 


87 


c 


ATOM 


16 


C 


ILE 


A 


684 


17. 


729 


-18 


.482 


-19. 


894 


1 


.00 


51. 


45 


c 


ATOM 


17 


O 


ILE 


A 


684 


18. 


162 


-18 


.002 


-20. 


919 


1 


.00 


51. 


14 


O 


ATOM 


18 


CB 


ILE 


A 


684 


19. 


743 


-19 


.375 


-18. 


591 


1 


.00 


53. 


30 


c 


ATOM 


19 


CGI 


ILE 


A 


684 


20. 


378 


-20 


. 637 


-17. 


959 


1 


.00 


51. 


27 


c 


ATOM 


20 


CG2 


ILE 


A 


684 


19. 


679 


-18 


.330 


-17. 


498 


1 


.00 


51. 


15 


c 


ATOM 


21 


GDI 


ILE 


A 


684 


19. 


366 


-21 


.452 


-17. 


179 


1 


.00 


50. 


03 


c 


ATOM 


22 


N 


PRO 


A 


685 


16. 


614 


-18 


.156 


-19. 


.255 


1 


.00 


48. 


47 


N 


ATOM 


23 


CA 


PRO 


A 


685 


15. 


999 


-16 


.875 


-19. 


543 


1 


.00 


47 . 


52 


c 


ATOM 


24 


C 


PRO 


A 


685 


16. 


988 


-15 


.705 


-19. 


389 


1 


.00 


46. 


9 6 


c 


ATOM 


25 


O 


PRO 


A 


685 


17. 


783 


-15 


.551 


-18. 


444 


1 


.00 


46. 


61 


O 


ATOM 


26 


CB 


PRO 


A 


685 


14. 


799 


-16 


.934 


-18. 


638 


1 


.00 


46. 


42 


c 


ATOM 


27 


CG 


PRO 


A 


685 


14. 


983 


-17 


.997 


-17. 


635 


1 


.00 


44. 


94 


c 


ATOM 


28 


CD 


PRO 


A 


685 


16. 


393 


-18 


.437 


-17. 


794 


1 


.00 


46. 


07 


c 


ATOM 


29 


N 


PRO 


A 


686 


16. 


875 


-14 


.806 


-20. 


397 


1 


.00 


44 . 


87 


N 


ATOM 


30 


CA 


PRO 


A 


686 


17. 


893 


-13 


.787 


-20. 


63 6 


1 


.00 


42. 


14 


c 


ATOM 


31 


C 


PRO 


A 


686 


18. 


311 


-12 


.833 


-19. 


536 


1 


.00 


36. 


79 


c 


ATOM 


32 


0 


PRO 


A 


686 


19. 


517 


-12 


. 670 


-.19. 


427 


1 


.00 


34. 


82 


O 


ATOM 


33 


CB 


PRO 


A 


686 


17. 


4 67 


-13 


.130 


-21. 


943 


1 


. 00 


42. 


03 


c 


ATOM 


34 


CG 


PRO 


A 


686 


16. 


624 


-14 


.106 


-22. 


644 


1 


.00 


41. 


43 


c 


ATOM 


35 


CD 


PRO 


A 


686 


16. 


208 


-15 


.159 


-21. 


694 


1 


.00 


42. 


48 


c 


ATOM 


36 


N 


LEU 


A 


687 


17. 


428 


-12 


.322 


-18. 


719 


1 


.00 


32. 


38 


N 


ATOM 


37 


CA 


LEU 


A 


687 


17 . 


639 


-11 


.687 


-17. 


420 


1 


.00 


27. 


80 


c 


ATOM 


38 


C 


LEU 


A 


687 


18. 


448 


-12 


.520 


-16. 


455 


1 


. 00 


26. 


34 


c 


ATOM 


39 


0 


LEU 


A 


687 


19. 


516 


-12 


.105 


-16. 


135 


1 


.00 


23. 


70 


O 


ATOM 


40 


CB 


LEU 


A 


687 


16. 


262 


-11 


.320 


-16. 


857 


1 


.00 


20. 


46 


c 


ATOM 


41 


CG 


LEU 


A 


687 


16. 


218 


-10 


.278 


-15. 


763 


1 


.00 


15. 


03 


c 


ATOM 


42 


CD1 


LEU 


A 


687 


17. 


167 


-9 


.187 


-16. 


281 


1 


.00 


10. 


95 


c 


ATOM 


43 


CD2 


LEU 


A 


687 


14 . 


744 


-10 


.067 


-15. 


421 


1 


.00 


8. 


68 


c 


ATOM 


4 4 ' 


N 


ILE 


A 


688 


18. 


135 


-13 


.802 


-16. 


326 


1 


.00 


28. 


10 


N 


ATOM 


45 


CA 


ILE 


A 


688 


18. 


8 65 


-14 


.717 


-15. 


455 


1 


. 00 


27 . 


04 


c 


ATOM 


46 


C 


ILE 


A 


688 


20. 


251 


-14 


.728 


-16. 


063 


1 


.00 


28 . 


14 


c 


ATOM 


47 


O 


ILE 


A 


688 


21. 


106 


-14 


. 947 


-15. 


202 


1 


.00 


28 . 


63 


O 


ATOM 


48 


CB 


ILE 


A 


688 


18. 


238 


-16 


.104 


-15. 


281 


1 


.00 


22. 


58 


c 


ATOM 


49 


CGI 


ILE 


A 


688 


16. 


900 


-16 


.103 


-14. 


562 


1 


.00 


17 . 


70 


c 


ATOM 


50 


CG2 


ILE 


A 


688 


19. 


220 


-17 


.086 


-14. 


614 


1 


.00 


21. 


80 


c 


ATOM 


51 


CD1 


ILE 


A 


688 


16. 


041 


-17 


.279 


-14. 


435 


1 


.00 


10. 


65 


c 


ATOM 


52 


N 


ASN 


A 


689 


20. 


606 


-14 


.488 


-17. 


305 


1 


.00 


27 . 


48 


N 


ATOM 


53 


CA 


ASN 


A 


689 


21. 


989 


-14 


.555 


-17. 


661 


1 


.00 


29. 


03 


C 


ATOM 


54 


C 


ASN 


A 


689 


22. 


626 


-13 


.207 


-17. 


331 


1 


.00 


31. 


08 


C 


ATOM 


55 


0 


ASN 


A 


689 


23. 


821 


-13 


.104 


-17. 


132 


1 


.00 


28. 


02 


O 


ATOM 


56 


CB 


ASN 


A 


689 


22. 


326 


-14 


.844 


-19. 


077 


1 


.00 


30. 


30 


C 
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ATOM 


57 


CG 


ASN 


A 


689 


21. 


.783 


-16.016 


-19.851 


1. 


.00 


32, 


.25 


C 


ATOM 


58 


OD1 


ASN 


A 


689 


21. 


.438 


-15.913 


-21.064 


1, 


.00 


28 


.32 


O 


ATOM 


59 


ND2 


ASN 


A 


689 


21. 


.758 


-17.080 


-19.032 


1. 


.00 


31, 


.87 


N 


ATOM 


60 


N 


LEU 


A 


690 


21. 


.802 


-12.151 


-17 .384 


1 


.00 


33 


.58 


N 


ATOM 


61 


CA 


LEU 


A 


690 


22, 


.304 


-10.825 


-17.095 


1, 


.00 


31 


.59 


C 


ATOM 


62 


C 


LEU 


A 


690 


22. 


.532 


-10.821 


-15.589 


1 


.00 


30 


.27 


C 


ATOM 


63 


O 


LEU 


A 


690 


23 


.644 


-10.480 


-15.178 


1 


.00 


31 


.78' 


o 


ATOM 


64 


CB 


LEU 


A 


690 


21, 


.470 


-9.748 


-17.782 


1 


.00 


29 


.77 


C 


ATOM 


65 


CG 


LEU 


A 


690 


22, 


.071 


-8.314 


-17.755 


1 


.00 


28 


.94 


C 


ATOM 


66 


CD1 


LEU 


A 


690 


20 


.984 


-7.416 


-18.306 


1 


.00 


30 


.28 


C 


ATOM 


67 


CD2 


LEU 


A 


690 


22. 


.351 


-7 .693 


-16.425 


1 


.00 


24 


.76 


C 


ATOM 


68 


N 


LEU 


A 


691 


21 


.623 


-11.247 


-14 .731 


1 


.00 


26 


.17 


N 


ATOM 


69 


CA 


LEU 


A 


691 


21 


.914 


-11.479 


-13.339 


1 


.00 


23 


.77 


C 


ATOM 


70 


C 


LEU 


A 


691 


23 


.263 


-12.119 


-13.152 


1 


.00 


25 


. 57 


C 


ATOM 


71 


O 


LEU 


A 


691 


24 


.224 


-11.608 


-12.585 


1 


.00 


24 


.09 


0 


ATOM 


72 


CB 


LEU 


A 


691 


20 


.805 


-12.213 


-12.599 


1 


.00 


15 


.70 


C 


ATOM 


73 


CG 


LEU 


A 


691 


19 


.475 


-11.390 


-12.510 


1 


.00 


12 


.57 


C 


ATOM 


74 


CD1 


LEU 


A 


691 


18 


.532 


-12.076 


-11.475 


1 


.00 


9 


.03 


C 


ATOM 


75 


CD2 


LEU 


A 


691 


19 


.629 


-9.926 


-12.273 


1 


.00 


7 


.87 


C 


ATOM 


76 


N 


MET 


A 


692 


23 


.548 


-13.267 


-13.723 


1 


.00 


30 


.72 


N 


ATOM 


77 


CA 


MET 


A 


692 


24 


.853 


-13.889 


-13.687 


1 


.00 


34 


.02 


C 


ATOM 


78 


C 


MET 


A 


692 


25 


.930 


-12.827 


-13.804 


1 


.00 


35 


.93 


C 


ATOM 


79 


O 


MET 


A 


692 


26 


.623 


-12.545 


-12.860 


1 


.00 


37 


. 90 


o 


ATOM 


80 


CB 


MET 


A 


692 


25 


.294 


-14.924 


-14 .704 


1 


.00 


30 


. 65 


C 


ATOM 


81 


CG 


MET 


A 


692 


26 


.553 


-15.665 


-14.231 


1 


.00 


31 


.53 


C 


ATOM 


82 


SD 


MET 


A 


692 


26 


.431 


-16.866 


-12.866 


1 


.00 


22 


.64 


s 


ATOM 


83 


CE 


MET 


A 


692 


26 


.242 


-15.633 


-11.547 


1 


.00 


27 


.35 


C 


ATOM 


84 


N 


SER 


A 


693 


25 


.969 


-12.223 


-14.998 


1 


.00 


37 


.17 


N 


ATOM 


85 


CA 


SER 


A 


693 


27 


.066 


-11.375 


-15.437 


1 


.00 


32 


.44 


c 


ATOM 


86 


C 


SER 


A 


693 


27 


.235 


-10.197 


-14.533 


1 


.00 


28 


. 99 


c 


ATOM 


87 


O 


SER 


A 


693 


28 


.388 


-9.868 


-14.374 


1 


.00 


26 


. 65 


o 


ATOM 


88 


CB 


SER 


A 


693 


27 


.172 


-11.057 


-16.905 


1 


.00 


28 


.75 


c 


ATOM 


89 


OG 


SER 


A 


693' 


26 


.151 


-10.369 


-17.490 


1 


.00 


26 


. 83 


o 


ATOM 


90 


N 


ILE 


A 


694 


26 


.219 


-9. 671 


-13.923 


1 


.00 


28 


.58 


N 


ATOM 


91 


CA 


ILE 


A 


694 


26 


.419 


-8.388 


-13.206 


1 


.00 


26 


.72 


c 


ATOM 


92 


C 


ILE 


A 


694 


26 


.826 


-8.702 


-11 .780 


1 


.00 


29 


. 89 


c 


ATOM 


93 


o 


ILE 


A 


694 


27 


.420 


-7.804 


-11.175 


1 


.00 


31 


.94 


o 


ATOM 


94 


CB 


ILE 


A 


694 


25 


.270 


-7.388 


-13.267 


1 


.00 


20 


.16 


c 


ATOM 


95 


CGI 


ILE 


A 


694 


23 


.895 


-8.003 


-13.143 


1 


.00 


17 


.56 


c 


ATOM 


96 


CG2 


ILE 


A 


694 


25 


.348 


-6.758 


-14.660 


1 


.00 


17 


.15 


c 


ATOM 


97 


CD1 


ILE 


A 


694 


22 


.940 


-7.177 


-12.352 


1 


.00 


17 


.17 


c 


ATOM 


98 


N 


GLU 


A 


695 


26 


.602 


-9.933 


-11.326 


1 


.00 


31 


. 55 


N 


ATOM 


99 


CA 


GLU 


A 


695 


27 


.163 


-10.359 


-10.071 


1 


.00 


33 


. 65 


C 


ATOM 


100 


C 


GLU 


A 


695 


28 


.637 


-9.932 


-9. 904 


1 


.00 


35 


.24 


C 


ATOM 


101 


o 


GLU 


A 


695 


29 


.430 


-10.234 


-10.793 


1 


.00 


31 


.74 


0 


ATOM 


102 


CB 


GLU 


A 


695 


27 


.030 


-11.862 


-9.864 


1 


.00 


33 


.91 


c 


ATOM 


103 


CG 


GLU 


A 


695 


27 


.019 


-12.393 


-8.446 


1 


.00 


33 


. 67 


c 


ATOM 


104 


CD 


GLU 


A 


695 


25 


.821 


-11.964 


-7.599 


1 


.00 


33 


.90 


c 


ATOM 


105 


OE1 


GLU 


A 


695 


24 


.849 


-11.629 


-8.297 


1 


.00 


34 


.22 


0 


ATOM 


106 


OE2 


GLU 


A 


695 


25 


.842 


-11.998 


-6.336 


1 


.00 


34 


.61 


o 


ATOM 


107 


N 


PRO 


A 


696 


28 


.896 


-9.274 


-8.757 


1 


.00 


37 


.14 


N 


ATOM 


108 


CA 


PRO 


A 


696 


30 


.174 


-8.821 


-8.351 


1 


.00 


38 


.49 


c 


ATOM 


109 


C 


PRO 


A 


696 


31 


.264 


-9.846 


-8.296- 


1 


.00 


39 


.84 


c 


ATOM 


110 


o 


PRO 


A 


696 


30 


.955 


-10.950 


-7.908 


1 


.00 


40 


.66 


0 


ATOM 


111 


CB 


PRO 


A 


696 


29 


.912 


-8.420 


-6.881 


1 


.00 


38 


.71 


c 


ATOM 


112 


CG 


PRO 


A 


696 


28 


.489 


-8.164 


-6.660 


1 


.00 


37 


.76 


c 


ATOM 


113 


CD 


PRO 


A 


696 


27 


.821 


-8.879 


-7.801 


1 


.00 


37 


.06 


c 


ATOM 


114 


N 


ASP 


A 


697 


32 


.532 


-9.551 


-8.496 


1 


.00 


42 


.18 


N 


ATOM 


115 


CA 


ASP 


A 


697 


33 


.638 


-10.421 


-8.087 


1 


.00 


44 


.28 


c 


ATOM 


116 


C 


ASP 


A 


697- 


34 


.026 


-10.470 


-6.588 


1 


.00 


43 


.77 


c 


ATOM 


117 


O 


ASP 


A 


697 


33 


. 644 


-9.624 


-5.798 


1 


.00 


42 


. 94 


O 


ATOM 


118 


CB 


ASP 


A 


697 


34 


.835 


-10.151 


-9.023 


1 


.00 


44 


.65 


c 


ATOM 


119 


CG 


ASP 


A 


697 


35 


.964 


-11.138 


-8.728 


1 
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-2.012 15.802 1.00 15.79 N 

-0.685 16.435 1.00 19.87 C 

-0.577 17.820 1.00 20.73 C 

0.365 18.125 1.00 16.27 O 

-0.149 16.289 1.00 17.85 C 

1.317 16.821 1.00 19.82 C 

2.461 16.013 1.00 17.22 C 

1.591 17.163 1.00 19.67 C 

-1.586 18.691 1.00 20.06 N 

-1.571 19.992 1.00 20.07 C 

-1.795 19.824 1.00 20.53 C 

-1.211 20.663 1.00 20.20 O 

-2.665 20.957 1.00 17.51 C 

-2.654 21.361 1.00 14.61 C 
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-3.873 21.597 1.00 7.39 N 

-2.539 18.902 1.00 19.64 N 

-2.378 18.772 1.00 17.66 C 

-1.025 18.229 1.00 20.84 C 
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-3.394 17.814 1.00 12.65 C 

-4.660 18.126 1.00 9.56 O 

-3.434 17.693 1.00 9.83 C 
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0.952 16.938 1.00 25.64 C 

1.943 18.075 1.00 24.42 C 

2.162 18.373 1.00 21.66 O 

1.514 16.100 1.00 22.07 C 

2.723 15.354 1..00 16.70 C 

2^835 14.871 1.00 17.66 C 

3.727 15.137 1.00 17.61 C 
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3.933 14.181 1.00 19.61 C 

4.869 14.479 1.00 18.61 C 

5.007 13.991 1.00 18.04 C 

2.395 18.654 1.00 25.58 N 

3.028 19.915 1.00 27.82 C 

2.639 20.914 1.00 29.47 C 

3.606 21.366 1.00 26.02 O 

2.860 20.444 1.00 26.96 C 

3.566 19.446 1.00 26.08 C 

3.475 21.810 1.00 28.88 C 

3.158 19.693 1.00 26.13 C 

1.427 21.347 1.00 31.97 N 

1.148 22.214 1.00 33.91 C 

0.924 21.576 1.00 34.80 C 

0.304 22.140 1.00 34.42 O 

-0.030 23.137 1.00 32.68 C 

-0.045 23.479 1.00 33.94 C 

-1.188 23.606 1.00 35.82 C 

-0.954 23.197 1.00 37.08 O 

-2.410 24.091 1.00 33.32 N 

1.487 20.416 1.00 35.40 N 

1.129 19.575 1.00 34.27 C 

1.444 20.274 1.00 33.79 C 

0.854 20.004 1.00 34.20 O 

1.727 18.146 1.00 31.15 C 

2.792, 17.777 1.00 25.99 O 

2.427 21.112 1.00 36.10 N 

3.063 21.611 1.00 40.55 C 

2.387 22.899 1.00 41.58 C 

2.315 23.315 1.00 40.05 O 

4.504 22.045 1.00 43.30 C 

5.293 20.720 1.00 48.60 C 

5.125 20.014 1.00 50.16 C 

6.323 20.159 1.00 53.41 N 

7.480 19.590 1.00 54.64 C 

7.838 18.397 1.00 52.95 N 

8.112 20.532 1.00 55.40 N 

1.975 23.506 1.00 41.09 N 

1.012 24.594 1.00 40.63 C 

-0.214 23.995 1.00 40.93 C 

-0.513 24.210 1.00 41.65 O 

0.667 25.010 1.00 38.87 C 

-0.744 23.046 1.00 39.67 N 

-1.942 22.307 1.00 35.57 C 

-1.991 21.554 1.00 36.49 C 

-3.136 21 .292 1.00 38.81 O 

-2.198 21.381 1.00 29.43 C 

-2.570 22.328 1.00 26.15 C 

-2.752 21.858 1.00 20.50 C 

-3.810 23.145 1.00 25.71 C 

-0.904 21.173 1.00 36.40 N 

-0.944 20.363 1.00 36.06 C 

-1.344 18.920 1.00 36.65 C 

-1.882 18.264 1.00 35.82 O 

-2.005 20.995 1.00 30.77 C 

-1.370 21.588 1.00 29.24 O 

-1.237 18.444 1.00 36.56 N 

-1.627 17.089 1.00 36.56 C 

-0.393 16.187 1.00 34.08 . C 

0.425 16.359 1.00 31.56 O 

-2.317 17.225 1.00 36.94 C 

-2.627 15.836 1.00 38.13 C 

-3.479 18.179 1.00 35.50 C 

-0.250 15.258 1.00 29.98 N 
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9.547 32.272 1.00 13.34 O 

6.629 30.891 1.00 2.34 C 

6.159 32.211 1.00 9.31 O 

8.443 33.089 1.00 13.69 N 

9.213 34.281 1.00 13.54 C 

10.705 34.021 1.00 18.14 C 

11.585 34.529 1.00 23.09 O 

8.655 35.139 1.00 9.26 C 

7.352 35.874 1.00 2.69 C 

6.778 36.536 1.00 2.36 C 
7.472 36.635 1.00 2.02 C 

11.225 33.336 1.00 19.45 N 

12.576 32.920 1.00 19.45 C 
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13,267 32.161 1.00 20.34 C 

14.428 32.318 1.00 20.42 O 

12.577 32.044 1.00 19.55 C 

13.325 32.741 1.00 18.07 C 

12.729 32.010 1.00 17.44 S 

11.129 32.863 1.00 20.66 C 

12.467 31.266 1.00 19.89 N 

12.937 30.576 1.00 19.54 C 

13.053 31.450 1.00 20.87 C 

14.147 31.607 1.00 20.84 O 

12.234 29.236 1.00 17.03 C 

11.353 29.332 1.00 15.19 C 

13.391 28.240 1.00 17.09 C 

12.028 32.153 1.00 21.93 N 

12.135 33.148 1.00 21.91 C 

13.275 34.136 1.00 22.60 C 

13.690 34.568 1.00 21.89 O 

10.832 33.974 1.00 15.40 C 

10.622 34.469 1.00 12.40 C 

10.699 33.565 1.00 2.42 C 

10.473 35.833 1.00 9.72 C 

10.423 34.148 1.00 6.31 C 

10.302 36.375 ' 1.00 6.59 C 
10.246 35.520 1.00 6.18 C 
13.777 34.568 1.00 23.27 N 
14.817 35.550 1.00 20.64 C 

16.168 34.937 1.00 20.81 C 
17.175 35.438 1.00 21.06 O 
16.292 33.805 1.00 22.59 N 
17.434 32.947 1.00 20.23 C 
17.605 32.807 1.00 20.66 C 
18.617 33.218 1.00 17.57 O 
17.285 31.606 1.00 17.31 C 

18.169 30.483 1.00 13.88 C 
19.671 30.816 1.00 12.20 C 
17.923 29.199 1.00 9.12 C 
16.605 32.296 1.00 20.53 N 
16.655 32.420 1.00 21.67 C 
17.413 33.695 1.00 23.47 C 
18.312 33.650 1.00 22.47 O 
16.920 34.885 1.00 21.96 N 
17.129 36.096 1.00 18.83 C 
18.609 36.371 1.00 20.15 C 

19.303 36.631 1.00 20.30 O 
16.321 37.278 1.00 15.03 C 
16.756 38.612 1.00 8.19 C 
17.379 39.598 1.00 7.66 C 
16.597 39.075 1.00 7.48 C 
17.637 40.661 1.00 6.70 N 
17.146 40.320 1.00 7.27 C 
15.975 38.575 1.00 7.50 C 
17.105 41.071 1.00 6.67 C 
16.049 39.294 1.00 2.50 C 
16.529 40.562 1.00 5.49 C 
18.984 36.352 1.00 22.34 N 
20.267 36.582 1.00 22.53 C 
21.253 35.624 1.00 23.52 C 
22.440 35.885 1.00 26.07 O 
20.346 36.193 1.00 17.76 C 
20.640 37.370 1.00 14.97 C 
20.775 37.123 1.00 14.11 C 
20.094 36.123 1.00 14.54 N 
18.844 35.664 1.00 13.59 C 
17.982 36.279 1.00 11.42 N 
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18.372 34.707 1.00 13.13 N 

20.920 34.426 1.00 23.57 N 

21.956 33.506 1.00 24.17 C 

22.306 33.822 1.00 29.29 C 

23.477 33.894 1.00 28.86 O 

21.596 32.083 1.00,17.40 C 

22.179 32.028 1.00 7.81 O 

21.225 34.075 1.00 31.05 N 

21.303 34.585 1.00 31.41 C 

21.831 35.989 1.00 31.48 C 

22.477 36.125 1.00 33.09 O 

19.977 34.447 1.00 32.03 C 

19.637 35.381 1.00 34.81 C 

20.074 35.047 1.00 34.01 C 

18.879 36.565 1.00 33.99 C 

19.772 35.826 1.00 34.10 C 

18.594 37.332 1.00 33.94 C 

19.033 36.978 1.00 33.60 C 

18.725 37.744 1.00 32.52 O 
21.588 36.979 1.00 32.32 N 
22.347 38.211 1.00 36.21 C 
23.868 38.198 1.00 37.30 C 
24.527 38.881 1.00 38.38 O 
21.886 39.313 1.00 35.58 C 
20.736 40.138 1.00 33.46 C 
20.877 41.551 1.00 32.31 C 
21.714 42.613 1.00 30.90 C 
21.002 43.968 1.00 26.65 N 
24.311 37.640 1.00 35.47 N 
25.610 37.560 1.00 34.20 C 
26.564 36.448 1.00 34.17 C 
27.774 36.577 1.00 34.57 O 
25.376 37.326 1.00 34.51 C 
24.826 38.518 1.00 36.08 C 
25.534 39.703 1.00 33.16 N 
23.561 38.689 1.00 34.81 C 
24.736 40.479 1.00 31.67 C 
23.542 39.917 1.00 33.57 N 

26.200 35.242 1.00 31.81 N 
27.185 34.164 1.00 28.56 C 

26.726 33.330 1.00 30.77 C 
26.893 32.131 1.00 31.49 O 
27.402 33.319 1.00 26.19 C 
28.320 34.059 1.00 23.78 C 

26.201 32.804 1.00 20.57 C 
25.874 33.943 1.00 30.02 N 
25.601 33.574 1.00 28.66 C 
24.539 32.506 1.00 32.41 C 
24.114 31.976 1.00 34.16 O 
26.941 33.272 1.00 25.15 C 
27.549 34.309 1.00 20.06 O 
24.001 32.151 1.00 34.20 N 
23.037 31.037 1.00 32.86 C 
23.868 29.827 1.00 30.98 C 
23.333 28.740 1.00 29.04 O 
25.166 29.993 1.00 30.75 N 
26.009 28.823 1.00 30.96 C 
26.406 28.357 1.00 30.00 C 
27.245 27.464 1.00 31.44 O 
27.274 28.939 1.00 29.63 C 
27.213 28.873 1.00 27.48 C 
25.898 28.275 1.00 26.05 C 
25.744 27.048 1.00 22.82 O 
25.217 29.285 1.00 23.09 N 
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25.870 28.937 1.00 28.08 N 

26.121 28.598 1.00 24.20 C 

24.865 29.013 1.00 20.59 C 

24.265 29.758 1.00 18.88 O 

27.389 29.323 1.00 23.71 C 

28.821 29.002 1.00 18.66 C 

29.933 30.119 1.00 20.14 S 

30.218 31.178 1.00 20.44 C 

24.291 28.590 1.00 18.31 N 

23.178 29.081 1.00 15.72 C 

23.578 30.219 1.00 20.60 C 

24.313 29.970 1.00 21.33 O 

22.377 28.144 1.00 3.10 C 

21.986 27.022 1.00 6.88 C 

21.594 25.660 1.00 6.40 C 

20.905 27.507 1.00 2.35 C 
23.089 31.444 1.00 22.28 N 
23.608 32.600 1.00 23.33 C 
22.582 32.858 1.00 25.45 C 
21.824 33.825 1.00 26.39 O 
23.755 33.811 1.00 20.42 C 
24.497 34.970 1.00 17.88 C 
23.855 35.863 1.00 17.45 C 
25.860 35.190 1.00 16.08 C 
24.474 36.899 1.00 17.55 C 
•26.483 36.191 1.00 15.00 C 
25.845 37.094 1.00 16.83 C 
26.458 38.170 1.00 14.24 O 
22.481 31.957 1.00 24.20 N 
21.666 32.210 1.00 22.82 C 
21.915 33.548 1.00 23.79 C 
20.954 34.278 1.00 24.63 O 
21.716 31.111 1.00 16.95 C 
21.025 29.874 1.00 12.19 C 
19.648 29.811 1.00 8.04 C 

21.906 28.834 1.00 10.42 C 

19.113 28.615 1.00 9.49 C 
21.280 27.668 1.00 10.61 C 
19.897 27.523 1.00 8.96 C 
23.127 33.867 1.00 23.58 N 
23.559 35.199 1.00 20.96 C 
24.991 35.465 1.00 21.45 C 
25.851 34.594 1.00 16.49 O 
23.231 35.177 1.00 17.74 C 
25.362 36.776 1.00 24.20 N 
26.820 37.077 1.00 26.98 C 

27.656 36.110 1.00 30.00 C 
28.612 35.474 1.00 33.83 O 
27.001 38.507 1.00 22.20 C 
25.681 38.849 1.00 23.60 C 
24.959 37.595 1.00 22.31 C 
27.379 35.897 1.00 30.71 N 
28.044 34.847 1.00 29.94 C 
27.567 33.436 1.00 28.66 - C 

28.114 32.637 1.00 30.84 O 
28.030 35.248 1.00 29.29 C 
26.639 35.154 1.00 28.78 C 
25.688 35.420 1.00 28.01 O 
26.618 34.808 1.00 28.47 O 

26.657 33.048 1.00 25.87 N 
26.365 31.574 1.00 25.10 C 
26.295 31.255 1.00 24.59 C 
25.158 31.500 1.00 23.38 O 
25.017 30.982 1.00 19.11 C 



7 CONT'D 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66599 



PCT/IB01/00475 



88/ 



ATOM 


2818 


CG 


LEU 


B 


782 


14 . 


359 


ATOM 


2819 


CD1 


LEU 


B 


782 


15. 


244 


ATOM 


2820 


CD2 


LEU 


B 


782 


15. 


297 


ATOM 


2821 


N 


ILE 


B 


783 


11. 


211 


ATOM 


2822 


CA 


ILE 


B 


783 


9. 


800 


ATOM 


2823 


C 


ILE 


B 


783 


9. 


527 


ATOM 


2824 


0 


ILE 


B 


783 


9. 


878 


ATOM 


2825 


CB 


ILE 


B 


783 


9. 


054 


ATOM 


2826 


CGI 


ILE 


B 


783 


9. 


261 


ATOM 


2827 


CG2 


ILE 


B 


783 


7. 


552 


ATOM 


2828 


CD1 


ILE 


B 


783 


8. 


716 


ATOM 


2829 


N 


LEU 


B 


784 


8. 


944 


ATOM 


2830 


CA 


LEU 


B 


784 


8. 


701 


ATOM 


2831 


C 


LEU 


B 


784 


7. 


258 


ATOM 


2832 


o 


LEU 


B 


784 


6. 


416 


ATOM 


2833 


CB 


LEU 


B 


784 


8. 


964 


ATOM 


2834 


CG 


LEU 


B 


784 


10. 


372 


ATOM 


2835 


CD1 


LEU 


B 


784 


10. 


624 


ATOM 


2836 


CD 2 


LEU 


B 


784 


11 . 


501 


ATOM 


2837 


N 


ASN 


B 


785 


7. 


024 


ATOM 


2838 


CA 


ASN 


B 


785 


5. 


836 


ATOM 


2839 


C 


ASN 


B 


785 


5. 


889 


ATOM 


2840 


0 


ASN 


B 


785 


6. 


855 


ATOM 


2841 


CB 


ASN 


B 


785 


5. 


521 


ATOM 


2842 


CG 


ASN 


B 


785 


6. 


402 


ATOM 


2843 


OD1 


ASN 


B 


785 


7. 


103 


ATOM 


2844 


ND2 


ASN 


B 


785 


6. 


499 


ATOM 


2845 


N 


GLU 


B 


786 


4 . 


840 


ATOM 


2846 


CA 


GLU 


B 


786 


4 . 


642 


ATOM 


2847 


C 


GLU 


B 


786 


5. 


564 


ATOM 


2848 


o 


GLU 


B 


786 


6. 


028 


ATOM 


2849 


CB 


GLU 


B 


786 


3 . 


188 


ATOM 


2850 


CG 


GLU 


B 


786 


2 . 


891 


ATOM 


2851 


CD 


GLU 


B 


786 


1 . 


440 


ATOM 


2852 


OE1 


GLU 


B 


786 


0 . 


702 


ATOM 


2853 


OE2 


GLU 


B 


786 


1. 


210 


ATOM 


2854 


N 


GLN 


B 


787 


5. 


947 


ATOM 


2855 


CA 


GLN 


B 


787 


6. 


991 


ATOM 


2856 


C 


GLN 


B 


787 


8 . 


291 


ATOM 


2857 


O 


GLN 


B 


787 


8 . 


607 


ATOM 


2858 


CB 


GLN 


B 


787 


6. 


988 


ATOM 


2859 


CG 


GLN 


B 


787 


7 . 


191 


ATOM 


2860 


CD 


GLN 


B 


787 


6. 


217 


ATOM 


2861 


OE1 


GLN 


B 


7 87 


6. 


518 


ATOM 


2862 


NE2 


GLN 


B 


787 


5. 


049 


ATOM 


2863 


N 


ARG 


B 


788 


9. 


118 


ATOM 


2864 


CA 


ARG 


B 


788 


10 . 


179 


ATOM 


2865 


C 


ARG 


B 


788 


10 . 


193 


ATOM 


2866 


O 


ARG 


B 


788 


11 . 


187 


ATOM 


2867 


CB 


ARG 


B 


788 


9. 


878 


ATOM 


2868 


CG 


ARG 


B 


788 


11. 


271 


ATOM 


2869 


CD 


ARG 


B 


788 


11. 


419 


ATOM 


2870 


NE 


ARG 


B 


788 


12 . 


838 


ATOM 


2871 


CZ 


ARG 


B 


788 


13. 


389 


ATOM 


2872 


NH1 


ARG 


B 


788 


12. 


642 


ATOM 


2873 


NH2 


ARG 


B 


788 


14. 


703 


ATOM 


2874 


N 


MET 


B 


789 


9. 


109 


ATOM 


2875 


CA 


MET 


B 


789 


8. 


626 


ATOM 


2876 


C 


MET 


B 


789 


8. 


821 


ATOM 


2877 


O 


MET 


B 


789 


9. 


981 


ATOM 


2878 


CB 


MET 


B 


789 


7. 


245 


ATOM 


2879 


CG 


MET 


B 


789 


7. 


205 


ATOM 


2880 


SD 


MET 


B 


789 


5. 


530 


ATOM 


2881 


CE 


MET 


B 


789 


5. 


224 
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24.804 29.553 1.00 12.78 C 

26.011 29.139 1.00 12.09 C 

23.640 29.391 1.00 2.11 C 

27.339 30.926 1.00 25.58 N 
27.107 30.632 1.00 26.90 C 
27.220 29.156 1.00 28.98 C 
28.246 28.576 1.00 31.43 O 
28.122 31.494 1_00 24.13 C 
27.679 32.937 1.00 24.47 C 
28.076 31.145 1.00 23.74 C 
28.637 33.976 1.00 24.36 C 
26.341 28.398 1.00 31.45 N 
26.610 26.991 1.00 34.72 C 

27.022 26.696 1.00 37.53 C 
26.135 26.636 1.00 38.84 O 
25.456 26.013 1.00 31.39 C 
24.958 25.784 1.00 28.84 C 
24.815 24.290 1.00 30.01 C 
25.704 26.402 1.00 27.46 C 
28.229 26.235 1.00 42.32 N 
28.649 25.485 1.00 45.46 C 
28.351 23.995 1.00 45.58 C 
27.827 23.471 1.00 43.57 O 
30.133 25.819 1.00 44.95 C 
31.180 25.179 1.00 45.26 C 
30.927 24.177 1.00 45.16 O 
32.414 25.665 1.00 44.97 N 
28.103 23.212 1.00 48.48 N 
28.311 21.894 1.00 50.98 C 
28.958 20.897 1.00 52.47 C 
28.423 19.916 1.00 54.11 O 
28.544 21.490 1.00 50.08 C 
27.385 20.556 1.00 51.99 C 
27.065 20.318 1.00 52.48 C 
27.983 20.763 1.00 52.93 O 
25.983 19.718 1.00 51.51 O 
30.149 21.191 1.00 53.42 N 
30.918 20.554 1.00 53.75 C 
30.173 20.535 1.00 55.11 C 
29.483 19.579 1.00 56.62 O 
32.185 21.434 1.00 55.00 C 
33.420 20.617 1.00 55.51 C 
33.535 19.467 1.00 57.44 C 
34.313 18.538 1.00 57.35 O 
32.851 19.483 1.00 58.58 N 
30.299 21.561 1.00 54.42 N 
29.395 21.878 1.00 52.33 C 
27.977 21.310 1.00 50.11 C 
27.458 20.805 1.00 50.10 O 
29.358 23.402 1.00 52.03 C 
29.330 23.979 1.00 54.01 C 
28.540 25.261 1.00 53.12 C 
28.681 25.582 1.00 53.12 N 
28.920 26.752 1.00 52.12 C 
29.040 27.824 1.00 51.80 N 

29.023 26.810 1.00 51.92 N 
27.267 21.553 1.00 42.42 N 
26.160 20.792 1.00 40.43 C 
26.461 19.335 1.00 43.39 C 

26.340 18.933 1.00 41.46 O 
25.728 21.286 1.00 35.22 C 
24.860 22.594 1.00 27.06 C 
24.084 22.712 1.00 18.85 S 
24.325 24.433 1.00 25.22 C 



7 CONT'D 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66599 



PCT/IB01/00475 



89/ 



ATOM 


2882 


N 


LYS 


B 


790 • 


7. 


901 


ATOM 


2883 


CA 


LYS 


B 


790 


7 . 


920 


ATOM 


2884 


C 


LYS 


B 


790 


9. 


250 


ATOM 


2885 


O 


LYS 


B 


790 


9. 


404 


ATOM 


2886 


CB 


LYS 


B 


7 90 


7. 


524 


ATOM 


2887 


CG 


LYS 


B 


790 


8. 


454 


ATOM 


2888 


CD 


LYS 


B 


790 


8. 


102 


ATOM 


2889 


CE 


LYS 


B 


790 


8. 


706 


ATOM 


2890 


NZ 


LYS 


B 


790 


9. 


583 


ATOM 


2897 


N 


GLU 


B 


791 


10. 


240 


ATOM 


2891 


CA 


GLU 


B 


791 


11. 


512 


ATOM 


2898 


C 


GLU 


B 


791 


12. 


562 


ATOM 


2899 


O 


GLU 


B 


791 


13. 


515 


ATOM 


2892 


CB 


GLU 


B 


791 


11. 


812 


ATOM 


2893 


CG 


GLU 


B 


791 


11. 


331 


ATOM 


2894 


CD 


GLU 


B 


791 


11. 


407 


ATOM 


2895 


OE1 


GLU 


B 


791 


12. 


014 


ATOM 


2896 


OE2 


GLU 


B 


791 


10. 


825 


ATOM 


2800 


N 


SER 


B 


792 


12. 


356 


ATOM 


2901 


CA 


SER 


B 


7 92 


13. 


161 


ATOM 


2904 


C 


SER 


B 


792 


13. 


885 


ATOM 


2905 


o 


SER 


B 


792 


14. 


807 


ATOM 


2902 


CB 


SER 


B 


792 


12. 


329 


ATOM 


2903 


OG 


SER 


B 


792 


11. 


217 


ATOM 


2906 


N 


SER 


B 


793 


13. 


349 


ATOM 


2907 


CA 


SER 


B 


793 


13. 


772 


ATOM 


2910 


C 


SER 


B 


793 


13 . 


055 


ATOM 


2911 


O 


SER 


B 


793 


13. 


338 


ATOM 


2908 


CB 


SER 


B 


793 


15. 


268 


ATOM 


2909 


OG 


SER 


B 


793 


15. 


486 


ATOM 


2912 


N 


PHE 


B 


794 


12 . 


082 


ATOM 


2913 


CA 


PHE 


B 


794 


11 . 


438 


ATOM 


2914 


C 


PHE 


B 


794 


10 . 


083 


ATOM 


2915 


O 


PHE 


B 


794 


9 . 


331 


ATOM 


2916 


CB 


PHE 


B 


794 


12 . 


483 


ATOM 


2917 


CG 


PHE 


B 


794 


12. 


813 


ATOM 


2918 


CD1 


PHE 


B 


794 


11 . 


952 


ATOM 


2919 


CD2 


PHE 


B 


794 


13 . 


994 


ATOM 


2920 


CE1 


PHE 


B 


794 


12. 


314 


ATOM 


2921 


CE2 


PHE 


B 


794 


14 . 


334 


ATOM 


2922 


cz 


PHE 


B 


794 


13. 


494 


ATOM 


2923 


N 


TYR 


B 


795 


9. 


511 


ATOM 


2924 


CA 


TYR 


B 


795 


8. 


097 


ATOM 


2925 


C 


TYR 


B 


795 


7 . 


203 


ATOM 


2926 


O 


TYR 


B 


795 


6. 


224 


ATOM 


2927 


CB 


TYR 


B 


795 


7 . 


687 


ATOM 


2928 


CG 


TYR 


B 


795 


6. 


239 


ATOM 


2929 


CD1 


TYR 


B 


795 


5. 


775 


ATOM 


2930 


CD2 


TYR 


B 


795 


5. 


293 


ATOM 


2931 


CE1 


TYR 


B 


795 


4 . 


443 


ATOM 


2932 


CE2 


TYR 


B 


795 


3. 


975 


ATOM 


2933 


CZ 


TYR 


B 


795 


3. 


555 


ATOM 


2934 


OH 


TYR 


B 


795 


2 . 


271 


ATOM 


2935 


N 


SER 


B 


796 


7 . 


525 


ATOM 


2936 


CA 


SER 


B 


796 


6. 


695 


ATOM 


2937 


C 


SER 


B 


796 


6. 


773 


ATOM 


2938 


o 


SER 


B 


796 


5. 


818 


ATOM 


2939 


CB 


SER 


B 


796 


7 . 


201 


ATOM 


2940 


OG 


SER 


B 


796 


6. 


260 


ATOM 


2941 


N 


LEU 


B 


797 


7. 


993 


ATOM 


2942 


CA 


LEU 


B 


797 


8. 


173 


ATOM 


2943 


C 


LEU 


B 


797 


7. 


456 


ATOM 


2944 


o 


LEU 


B 


797 


6. 


516 


ATOM 


2945 


CB 


LEU 


B 


797 


9. 


627 
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27.019 18.554 1.00 49.59 N 

27.356 17.152 1.00 53.66 C 

27.138 16.438 1.00 55.59 C 

26.449 15.451 1.00 54.99 O 

28.803 16.823 1.00 58.38 C 

30.000 16.839 1.00 62.98 C 

31.467 16.812 1.00 64.35 C 

32.470 17.750 1.00 64.80 C 

33.615 17.398 1.00 65.89 N 

27.883 16.928 1.00 59.01 N 

28.180 16.325 1.00 60.26 C 

27.372 17.075 1.00 59.57 C 

28.084 17.362 1.00 59.58 O 

29.690 16.396 1.00 61.44 C 

30.541 15.230 1.00 64.18 C 

32.023 14.964 1.00 63.81 C 

32.873 15.686 1.00 64.39 OE2 O 

32.479 13.921 1.00 61.58 OEl O 

26.093 17.336 1.00 58.96 N 
25.386 18.318 1.00 60.47 C 
24.146 17.800 1.00 61.28 C 
23.637 18.423 1.00 60.51 O 
24.856 19.496 1.00 60.59 C 
24.071 19.068 1.00 60.51 O 
23.537 16.740 1.00 62.28 , N 
22.390 15.980 1.00 59.38 C 

21.094 16.395 1.00 59.29 C 
19.956 15.956 1.00 59.26 O 
22.092 16.046 1.00 58.77 C 
21.177 17.131 1.00 58.35 O 
21.293 17.298 1.00 56.76 N 
20.101 17.848 1.00 53.98 C 
20.449 18.381 1.00 52.47 C 
19.776 19.089 1.00 52.94 O 
19.449 18.723 1.00 53.99 C 
20.036 20.049 1.00 52.79 C 
19.835 21.112 1.00 51.28 CD2 C 
20.776 20.193 1.00 52.69 CD1 C 
20.392 22.323 1.00 53.40 CE2 C 
21.333 21.405 1.00 51.61 CEl C 
21.124 22.484 1.00 52.92 C 
21.498 17.791 1.00 51.98 N 
21.854 17.948 1.00 50.02 C 
20.620 17.824 1.00 47.15 C 
20.519 18.550 1.00 44.52 O 
22.940 16.942 1.00 50.67 C 
23.360 17.090 1.00 51.96 C 
23.816 18.328 1.00 53.04 C 
23.257 16.068 1.00 50.61 C 
24.197 18.488 1.00 52.73 C 
23.606 16.219 1.00 48.73 C 
24.092 17.430 1.00 51.21 C 
24.50^^ 17.724 1.00 52.07 O 
19.702 16.928 1.00 45.93 N 
18.577 16.630 1.00 45.44 C 
17.526 17.727 1.00 46.09 C 
16.753 17.932 1.00 46.52 O 
18.008 15.310 1.00 44.49 C 
17.068 14.866 1.00 44.33 O 
17.547 18.339 1.00 42.89 N 
16.772 19.556 1.00 39.09 C 
17.493 20.695 1.00 37.45 C 
16.975 21.314 1.00 38.11 O 
16.544 19.976 1.00 35.66 C 



7 CONT'D 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66599 



PCT/IB01/00475 



90/ 



ATOM 


2946 


CG 


LEU 


B 


797 


9. 


865 


ATOM 


2947 


CD1 


LEU 


B 


797 


9. 


97 6 


ATOM 


2948 


CD2 


LEU 


B 


797 


11. 


063 


ATOM 


2949 


N 


CYS 


B 


798 


7. 


846 


ATOM 


2950 


CA 


CYS 


B 


798 


7. 


108 


ATOM 


2951 


C 


CYS 


B 


798 


5. 


586 


ATOM 


2952 


O 


CYS 


B 


798 


5. 


028 


ATOM 


2953 


CB 


CYS 


B 


798 


7. 


309 


ATOM 


2954 


SG 


CYS 


B 


798 


9. 


037 


ATOM 


2955 


N 


LEU 


B 


799 


4 . 


924 


ATOM 


2956 


CA 


LEU 


B 


799 


3. 


486 


ATOM 


2957 


C 


LEU 


B 


799 


3. 


187 


ATOM 


2958 


O 


LEU 


B 


799 


2. 


265 


ATOM 


2959 


CB 


LEU 


B 


799 


2. 


901 


ATOM 


2960 


CG 


LEU 


B 


799 


2. 


726 


ATOM 


2961 


CD1 


LEU 


B 


799 


2. 


475 


ATOM 


2962 


CD2 


LEU 


B 


799 


1. 


621 


ATOM 


2963 


N 


THR 


B 


800 


3. 


925 


ATOM 


2964 


CA 


THR 


B 


800 


3. 


643 


ATOM 


2965 


C 


THR 


B 


800 


3. 


728 


ATOM 


2966 


o 


THR 


B 


800 


2. 


922 


ATOM 


2967 


CB 


THR 


B 


800 


4 . 


667 


ATOM 


2968 


OG1 


THR 


B 


800 


4 . 


563 


ATOM 


2969 


CG2 


THR 


B 


800 


4 . 


547 


ATOM 


2970 


N 


MET 


B 


801 


4. 


791 


ATOM 


2971 


CA 


MET 


B 


801 


5. 


035 


ATOM 


2972 


C 


MET 


B 


801 


3. 


903 


ATOM 


2973 


O 


MET 


B 


801 


3. 


239 


ATOM 


2974 


CB 


MET 


B 


801 


6. 


312 


ATOM 


2975 


CG 


MET 


B 


801 


7. 


317 


ATOM 


2976 


SD 


MET 


B 


801 


8. 


947 


ATOM 


2977 


CE 


MET 


B 


801 


9. 


502 


ATOM 


2978 


N 


TRP 


B 


802 


3. 


478 


ATOM 


2979 


CA 


TRP 


B 


802 


2. 


283 


ATOM 


2980 


C 


TRP 


B 


802 


0. 


982 


ATOM 


2981 


O 


TRP 


B 


802 


0. 


031 


ATOM 


2982 


CB 


TRP 


B 


802 


2. 


273 


ATOM 


2983 


CG 


TRP 


B 


802 


1. 


781 


ATOM 


2984 


CD1 


TRP 


B 


802 


1. 


565 


ATOM 


2985 


CD2 


TRP 


B 


802 


1 . 


464 


ATOM 


2986 


NE1 


TRP 


B 


802 


1. 


118 


ATOM 


2987 


CE2 


TRP 


B 


802 


1. 


059 


ATOM 


2988 


CE3 


TRP 


B 


802 


1. 


492 


ATOM 


2989 


CZ2 


TRP 


B 


802 


0. 


64 9 


ATOM 


2990 


CZ3 


TRP 


B 


802 


1. 


082 


ATOM 


2991 


CH2 


TRP 


B 


802 


0. 


660 


ATOM 


2992 


N 


GLN 
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4.145 46.089 1.00 25.56 O 

2.708 46.658 1.00 25.87 C 
3.779 47.807 1.00 21.42 C 
3.071 49.095 1.00 18.95 C 
1.825 48.988 1.00 17.90 O 
3.644 50.073 1.00 16.47 O 
3.039 44.219 1.00 26.19 N 
3.844 43.143 1.00 23.03 C 
3.813 43.129 1.00 24.94 C 
4.789 42.945 1.00 21.76 O 
3.267 41.775' 1.00 14.31 C 
4.053 40.602 1.00 10.03 C 
5.477 40.607 1.00 2.02 C 
3.311 39.324 1.00 7.57 C 

2.575 43.430 1.00 26.93 N 
2.473 43.855 1.00 26.99 C 
3.061 45.202 1.00 30.41 C 
3.579 45.316 1.00 32.41 O 
1.023 43.850 1.00 23.00 C 
0.481 42.428 1.00 19.67 C 
0.773 44.388 1.00 20.02 C 

-0.749 42.809 1.00 23.56 C 

3.079 46.210 1.00 32.54 N 

4.008 47.343 1.00 34.91 C 

5.515 47.028 1.00 37.36 C 

6.338 47.748 1.00 36.92 O 

3.480 48.454 1.00 33.24 C 

1.981 48.722 1.00 31.08 C 

1.285 49.424 1.00 29.06 C 

1.460 50.911 1.00 28.19 C 

0.812 51.612 1.00 25.68 N 

6.026 45.964 1.00 37.65 N 

7.424 45.655 1.00 36.73 C 

7.967 44.855 1.00 35.80 C 

9.029 45.095 1.00 34.95 O 

7.632 44.788 1.00 36.96 C 

7.173 43.875 1.00 36.18 N 

7.258 43.092 1.00 35.82 C 

7.249 44.094 1.00 37.26 C 

8.048 43.987 1.00 33.81 O 

6.105 42.119 1.00 34.46 C 

5.830 41.080 1.00 32.14 C 

6.397 41.505 1.00 31.81 C 

4.717 40.085 1.00 29.67 C 

6.355 45.083 1.00 39.83 N 

6.313 46.204 1.00 41.12 C 

7.576 47.000 1.00 43.00 C 

7.709 47.749 1.00 41.73 O 
8.623 46.837 1.00 45.67 N 
9.691 47.807 1.00 50.53 C 

10.737 47.463 1.00 56.36 C 

11.628 48.217 1.00 55.00 O 

10.039 47.575 1.00 47.55 C 

9.193 48.348 1.00 44.32 C 

9.589 47.917 1.00 43.80 C 

9.554 49.817 1.00 43.10 C 

10.617 46.200 1.00 62.89 N 

11.645 45.450 1.00 70.18 C 

10.965 44.640 1.00 74.03 C 

11.106 44.868 1.00 75.14 O 

12.423 44.459 1.00 7l762 C 

13.743 44.925 1.00 74.53 C 

14.398 44.067 1.00 76.86 C 

15.436 44.861 1.00 80.82 N 
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2.707 19.422 1.00 19.10 C 
1.971 18.269 1.00 20.56 C 
0.518 18.595 1.00 21.99 N 
7.542 20.645 1.00 27.56 N 
8.819 20.059 1.00 29.48 C 
9.630 20.711 1.00 29.11 C 
9.861 20.013 1.00 30.81 O 
9.576 20.107 1.00 30.54 C 
8.836 19.134 1.00 31.90 C 
9.568 18.895 1.00 33.96 C 

10.787 19.139 1.00 32.26 O 

8.659 18.402 1.00 32.97 N 

10.068 21.960 1.00 25.94 N 

10.580 22.601 1.00 21.19 C 

9.587 22.309 1.00 20.61 C 

10.070 21.924 1.00 19.81 O 

11.020 24.055 1.00 13.98 C 

11.534 24.371 1.00 12.02 C 

11.711 25.850 1.00 10.99 C 

12.769 23.644 1.00 2.02 C 

8.255 22.373 1.00 18.68 N 

7.391 22.421 1.00 17.95 C 

7.402 21.259 1.00 16.56 C 

7.523 21.143 1.00 2.76 O 

5.996 22.683 1.00 17.27 C 

5.611 24.165 1.00 17.40 C 

4.346 24.617 1.00 14.19 N 

6.357 25.296 1.00 12.75 C 

4.329 25.904 1.00 9.27 C 

5.446 26.253 1.00 7.29 N 

7.000 20.168 1.00 17.67 N 

7.273 18.838 1.00 18.57 C 

8.668 18.620 1.00 16.85 C 

8.907 18.160 1.00 16.59 O 

6.983 17.995 1.00 17.20 C 

7.007 16.497 1.00 15.05 C 

6.830 16.080 1.00 11.53 C 

5.763 16.139 1.00 20.26 C 
9.753 18.995 1.00 15.98 N 

11.144 18.988 1.00 15.80 C 

11.279 19.679 1.00 15.92 C 

11.984 19.129 1.00 13.21 O 

12.167 19.520 1.00 14.10 C 

13.612 19.309 1.00 16.01 C 



7 CONT'D 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66599 



PCT/IB01/00475 



102/ 



ATOM 


3714 


CD1 


TYR 


B 


890 


13 


.442 


ATOM 


3715 


CD2 


TYR 


B 


890 


14 


.322 


ATOM 


3716 


CE1 


TYR 


B 


890 


13 


.870 


ATOM 


3717 


CE2 


TYR 


B 


890 


14 


.724 


ATOM 


3718 


CZ 


TYR 


B 


890 


14 


.476 


ATOM 


3719 


OH 


TYR 


B 


890 


14 


.884 


ATOM 


3720 


N 


CYS 


B 


891 


15 


.709 


ATOM 


3721 


CA 


CYS 


B 


8 91 


16 


.793 


ATOM 


3722 


C 


CYS 


B 


891 


17 


.952 


ATOM 


3723 


O 


CYS 


B 


891 


19 


.048 


ATOM 


3724 


CB 


CYS 


B 


891 


16 


.289 


ATOM 


3725 


SG 


CYS 


B 


891 


17 


.726 


ATOM 


3726 


N 


LEU 


B 


892 


17 


.703 


ATOM 


3727 


CA 


LEU 


B 


892 


18 


.894 


ATOM 


3728 


C 


LEU 


B 


892 


19 


,425 


ATOM 


3729 


O 


LEU 


B 


892 


20 


.691 


ATOM 


3730 


CB 


LEU 


B 


892 


18 


.904 


ATOM 


3731 


CG 


LEU 


B 


892 


19 


.226 


ATOM 


3732 


CD1 


LEU 


B 


892 


20 


.531 


ATOM 


3733 


CD2 


LEU 


B 


892 


18 


.229 


ATOM 


3734 


N 


ASN 


B 


893 


18 


.527 


ATOM 


3735 


CA 


ASN 


B 


893 


19 


.020 


ATOM 


3736 


C 


ASN 


B 


893 


19 


.918 


ATOM 


3737 


O 


ASN 


B 


893 


20 


.949 


ATOM 


3738 


CB 


ASN 


B 


893 


17 


.830 


ATOM 


3739 


CG 


ASN 


B 


893 


17 


.606 


ATOM 


3740 


OD1 


ASN 


B 


893 


16 


.510 


ATOM 


3741 


ND2 


ASN 


B 


893 


18 


.742 


ATOM 


3742 


N 


THR 


B 


894 


19 


.611 


ATOM 


3743 


CA 


THR 


B 


894 


20 


.313 


ATOM 


3744 


C 


THR 


B 


894 


21 


.563 


ATOM 


3745 


o 


THR 


B 


894 


22 


.615 


ATOM 


3746 


CB 


THR 


B 


894 


19 


.343 


ATOM 


3747 


OG1 


THR 


B 


894 


18 


.105 


ATOM 


3748 


CG2 


THR 


B 


894 


19 


.894 


ATOM 


3749 


ISI 


PHE 


B 


895 


21 


.508 


ATOM 


3750 


CA 


PHE 


B 


895 


22 


.779 


ATOM 


3751 


C 


PHE 


B 


895 


23 


.770 


ATOM 


3752 


0 


PHE 


B 


895 


24 


.748 


ATOM 


3753 


CB 


PHE 


B 


895 


22 


.557 


ATOM 


3754 


CG 


PHE 


B 


895 


23 


.599 


ATOM 


3755 


CD1 


PHE 


B 


895 


24 


.119 


ATOM 


3756 


CD2 


PHE 


B 


895 


24 


.111 


ATOM 


3757 


CE1 


PHE 


B 


895 


25 


.090 


ATOM 


3758 


CE2 


PHE 


B 


895 


25 


.155 


ATOM 


3759 


CZ 


PHE 


B 


895 


25 


.636 


ATOM 


3760 


N 


ILE 


B 


896 


23 


.568 


ATOM 


37 61 


CA 


ILE 


B 


896 


24 


.326 


ATOM 


3762 


C 


ILE 


B 


896 


24 


.699 


ATOM 


3763 


O 


ILE 


B 


896 


25 


.826 


ATOM 


3764 


CB 


ILE 


B 


896 


23 


.536 


ATOM 


3765 


CGI 


ILE 


B 


896 


22 


.961 


ATOM 


3766 


CG2 


ILE 


B 


896 


24 


.431 


ATOM 


37 67 


CD1 


ILE 


B 


896 


21 


.851 


ATOM 


3768 


N 


GLN 


B 


897 


23 


.856 


ATOM 


3769 


CA 


GLN 


B 


8 97 


24 


.405 


ATOM 


3770 


C 


GLN 


B 


897 


24 


.907 


ATOM 


3771 


O 


GLN 


B 


897 


25 


.129 


ATOM 


3772 


CB 


GLN 


B 


8 97 


23 


.410 


ATOM 


3773 


CG 


GLN 


B 


897 


23 


.437 


ATOM 


3774 


CD 


GLN 


B 


897 


21 


.930 


ATOM 


3775 


OE1 


GLN 


B 


897 


21 


.659 


ATOM 


3776 


NE2 


GLN 


B 


897 


21 


.303 


ATOM 


3777 


N 


SER 


B 


898 


25 


.099 



FIG 
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14.213 18.091 1.00 13.30 CD2 C 

14.406 20.283 1.00 16.52 GDI C 

15.448 17.762 1.00 11.78 CE2 C 

15.681 19.943 1.00 18.15 CE1 C 

16.234 18.679 1.00 17.51 C 

17.535 18.332 1.00 19.47 O 

10.594 20.821 1.00 16.14 N 

10.649 21.695 1.00 15.40 C 

9.869 21.094 1.00 17.49 C 

10.572 20.952 1.00 14.73 O 

10.299 23.108 1.00 11.12 C 

10.264 24.275 1.00 4.51 S 

8.611 20.648 1.00 15.56 N 

7.943 20.046 1.00 19.16 C 

8.673 18.791 1.00 20.63 C 

8.866 18.634 1.00 16.74 O 

6.454 19.972 1.00 17.28 C 

5.263 19.089 1.00 17.15 C 

4.535 19.209 1.00 15.20 C 

4.111 19.144 1.00 11.84 C 

9.225 17.965 1.00 18.95 N 

9.980 16.819 1.00 19.32 C 

11.136 17.172 1.00 20.72 C 

11.450 16.555 1.00 20.23 O 

10.413 15.909 1.00 15.23 C 
9.343 14.786 1.00 14.40 C 
9.000 14.326 1.00 12.57 O 
8.728 14.372 1.00 2.74 N 

11.838 18.288 1.00 20.30 N 

13.045 18.639 1.00 17.91 C 

12.601 19.343 1.00 20.83 C 

13.229 19.290 1.00 19.07 O 

13.820 19.502 1.00 16.77 C 

14.068 18.853 1.00 13.25 O 

15.150 19.967 1.00 16.02 C 
11.442 20.011 1.00 24.80 N 
10.891 20.601 1.00 25.35 C 
10.547 19.519 1.00 24.13 C 

11.151 19.245 1.00 21.42 O 
9.709 21.552 1.00 24.79 C 
9.119 22.443 1.00 23.13 C 
9.826 23.556 1.00 21.85 C 
7.841 22.195 1.00 21.00 C 
9.191 24.329 1.00 20.63 C 
7.318 23.000 1.00 17.97 C 
7.955 24.079 1.00 14.91 C 
9.505 18.741 1.00 25.81 N 
9.265 17.542 1.00 28.03 C 

10.466 16.714 1.00 29.22 C 

10.414 16.245 1.00 25.47 O 
8.286 16.667 1.00 29.09 C 
7.116 17.494 1.00 28.07 C 
7.801 15.524 1.00 29.16 C 
6.432 16.736 1.00 26.31 C 

11.458 16.402 1.00 31.53 N 

12.610 15.669 1.00 34.11 C 

13.785 16.479 1.00 35.24 C 

14.885 15.989 1.00 34.94 O 

13.170 14.672 1.00 33.84 C 

12.614 13.259 1.00 33.85 C 

12.417 13.012 1.00 35.17 C 

11.216 12.864 1.00 35.65 O 

13.588 13.032 1.00 34.05 N 

13.656 17.781 1.00 35.52 N 



7 CONT'D 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66599 



PCT/IB01/00475 



103/ 



ATOM 


' 3778 


CA 


SER 


B 


898 


25. 


413 


ATOM 


3779 


C 


SER 


B 


898 


26. 


624 


ATOM 


3780 


O 


SER 


B 


898 


26. 


592 


ATOM 


3781 


CB 


SER 


B 


898 


25. 


610 


ATOM 


3782 


OG 


SER 


B 


898 


26. 


960 


ATOM 


3783 


N 


ARG 


B 


899 


27. 


653 


ATOM 


3784 


CA 


ARG 


B 


899 


28. 


832 


ATOM 


3785 


C 


ARG 


B 


899 


28. 


697 


ATOM 


3786 


O 


ARG 


B 


899 


29. 


448 


ATOM 


3787 


CB 


ARG 


B 


899 


30. 


010 


ATOM 


3788 


CG 


ARG 


B 


899 


30. 


935 


ATOM 


3789 


CD 


ARG 


B 


899 


31. 


975 


ATOM 


3790 


NE 


ARG 


B 


899 


32. 


859 


ATOM 


3791 


CE 


ARG 


B 


899 


33. 


457 


ATOM 


3792 


NH1 


ARG 


B 


899 


33. 


325 


ATOM 


3793 


NH2 


ARG 


B 


899 


34. 


249 


ATOM 


3794 


N 


ALA 


B 


900 


27. 


763 


ATOM 


3795 


CA 


ALA 


B 


900 


27. 


400 


ATOM 


3796 


C 


ALA 


B 


900 


26. 


4 95 


ATOM 


3797 


O 


ALA 


B 


900 


26. 


814 


ATOM 


3798 


CB 


ALA 


B 


900 


26. 


870 


ATOM 


3799 


N 


LEU 


B 


901 


25. 


529 


ATOM 


3800 


CA 


LEU 


B 


901 


24. 


544 


ATOM 


3801 


C 


LEU 


B 


901 


24. 


962 


ATOM 


3802 


o 


LEU 


B 


901 


24. 


163 


ATOM 


3803 


CB 


LEU 


B 


901 


23. 


158 


ATOM 


3804 


CG 


LEU 


B 


901 


22. 


974 


ATOM 


3805 


CD1 


LEU 


B 


901 


22. 


024 


ATOM 


3806 


CD2 


LEU 


B 


901 


22. 


779 


ATOM 


3807 


N 


SER 


B 


902 


26. 


143 


ATOM 


3808 


CA 


SER 


B 


902 


26. 


665 


ATOM 


3809 


C 


SER 


B 


902 


25. 


919 


ATOM 


3810 


O 


SER 


B 


902 


25. 


928 


ATOM 


3811 


CB 


SER 


B 


902 


26. 


660 


ATOM 


3812 


OG 


SER 


B 


902 


27. 


826 


ATOM 


3813 


N 


VAL 


B 


903 


25. 


282 


ATOM 


3814 


CA 


VAL 


B 


903 


24- 


624 


ATOM 


3815 


C 


VAL 


B 


903 


25. 


514 


ATOM 


3816 


o 


VAL 


B 


903 


26. 


116 


ATOM 


3817 


CB 


VAL 


B 


903 


23. 


206 


ATOM 


3818 


CGI 


VAL 


B 


903 


22. 


579 


ATOM 


3819 


CG2 


VAL 


B 


903 


22. 


210 


ATOM 


3820 


N 


GLU 


B 


904 


25. 


690 


ATOM 


3821 


CA 


GLU 


B 


904 


26. 


288 


ATOM 


3822 


C 


GLU 


B 


904 


25. 


312 


ATOM 


3823 


o 


GLU 


B 


904 


24. 


482 


ATOM 


3824 


CB 


GLU 


B 


904 


27. 


074 


ATOM 


3825 


CG 


GLU 


B 


904 


27. 


616 


ATOM 


3826 


CD 


GLU 


B 


904 


29. 


083 


ATOM 


3827 


OE1 


GLU 


B 


904 


29. 


830 


ATOM 


3828 


OE2 


GLU 


B 


904 


29. 


572 


ATOM 


3829 


N 


PHE 


B 


905 


25. 


478 


ATOM 


3830 


CA 


PHE 


B 


905 


24. 


920 


ATOM 


3831 


C 


PHE 


B 


905 


25. 


834 


ATOM 


3832 


O 


PHE 


B 


905 


27. 


000 


ATOM 


3833 


CB 


PHE 


B 


905 


24. 


686 


ATOM 


3834 


CG 


PHE 


B 


905 


23. 


458 


ATOM 


3835 


CD1 


PHE 


B 


905 


23. 


579 


ATOM 


3836 


CD2 


PHE 


B 


905 


22. 


231 


ATOM 


3837 


CE1 


PHE 


B 


905 


22. 


474 


ATOM 


3838 


CE2 


PHE 


B 


905 


21. 


160 


ATOM 


3839 


CZ 


PHE 


B 


905 


21. 


261 


ATOM 


3840 


N 


PRO 


B 


906 


25. 


403 


ATOM 


3841 


CA 


PRO 


B 


906 


26. 


150 



FIG. 
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14.707 18.661 1.00 33.62 C 

15.509 18.289 1.00 33.45 C 

16.645 18.735 1.00 34.54 O 

14.034 20.025 1.00 30.68 C 

13.689 19.991 1.00 27.78 O 

15.104 17.667 1.00 35.55 N 

15.874 17.364 1.00 40.38 C 

16.707 16.062 1.00 41.20 C 
17.609 15.721 1.00 39.68 O 
14.963 17.183 1.00 41.53 C 

14.437 18.207 1.00 46.23 C 
13.446 17.697 1.00 48.94 C 
12.751 18.597 1.00 50.88 N 
13.053 19.734 1.00 53.48 C 
14.235 20.386 1.00 54.28 N 
12.089 20.232 1.00 52.21 N 
16.289 15.242 1.00 40.51 N 
17.066 14.094 1.00 40.10 C 
18.109 14.713 1.00 40.90 C 
19.301 14.542 1.00 41.97 O 
16.263 12.943 1.00 38.11 C 
17.737 15.535 1.00 39.70 N 
18.698 16.019 1.00 36.26 C 
19.569 17.170 1.00 36.70 C 
20.472 17.547 1.00 38.56 O 
18.120 16.394 1.00 31.70 C 
16.821 15.577 1.00 27.80 C 
15.857 16.264 1.00 23.82 C 
17.383 14.177 1.00 25.92 C 
19-394 17.711 1.00 37.08 N 
20.193 18.823 1.00 36.73 C 
20.013 20.156 1.00 36.59 C 
20.979 20.933 1.00 35.55 O 
21.714 18.596 1.00 34.96 C 
22.087 17.931 1.00 33.22 O 
18.849 20.320 1.00 35.85 N 

18.708 21.599 1.00 36.50 C 
17.844 22.517 1.00 34.32 C 
16.801 22.207 1.00 31.53 O 
18.200 21.488 1.00 38.53 C 
18.173 22.890 1.00 39.84 C 
18.967 20.633 1.00 38.60 C 
18.382 23.721 1.00 30.70 N 
17.595 24.800 1.00 32.36 C 
16.718 25.599 1.00 29.17 C 
17.354 26.244 1.00 26.03 O 
18.395 25.865 1.00 34.89 C 
17.503 26.953 1.00 40.06 C 
17.081 27.044 1.00 44.95 C 
17.256 26.003 1.00 45.53 O 
16.554 28.149 1.00 43.13 O 
15.386 25.516 1.00 25.00 N 

14.438 26.485 1.00 21.52 C 
14.215 27.677 1.00 20.95 C 
13.888 27.426 1.00 23.36 O 
12.961 25.989 1.00 15.44 C 
12.984 25.080 1.00 12.51 C 
13.601 23.813 1.00 6.60 C 
12.515 25.543 1.00 8.50 C 
13.689 23.002 1.00 2.13 C 
12.707 24.686 1.00 7.63 C 
13.332 23.453 1.00 5.34 C 
14.434 28.916 1.00 20.49 N 
14.071 30.092" 1.00 18.71 C 



7 CONT'D 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66599 



PCT/IB01/00475 



104/ 



ATOM 


3842 


C 


PRO 


B 


906 


26. 


343 


ATOM 


3843 


O 


PRO 


B 


906 


25. 


292 


ATOM 


3844 


CB 


PRO 


B 


906 


25. 


380 


ATOM 


3845 


CG 


PRO 


B 


906 


24. 


279 


ATOM 


3846 


CD 


PRO 


B 


906 


24. 


136 


ATOM 


3847 


N 


GLU 


B 


907 


27. 


511 


ATOM 


3848 


CA 


GLU 


B 


907 


27. 


704 


ATOM 


3849 


C 


GLU 


B 


907 


26. 


590 


ATOM 


3850 


O 


GLU 


B 


907 


26. 


525 


ATOM 


. 3851 


CB 


GLU 


B 


907 


28. 


658 


ATOM 


3852 


CG 


GLU 


B 


907 


30. 


150 


ATOM 


3853 


CD 


GLU 


B 


907 


31. 


021 


ATOM 


3854 


OE1 


GLU 


B 


907 


31. 


153 


ATOM 


3855 


OE2 


GLU 


B 


907 


31. 


534 


ATOM 


3856 


N 


MET 


B 


908 


25. 


741 


ATOM 


3857 


CA 


MET 


B 


908 


24 . 


898 


ATOM 


3858 


C 


MET 


B 


908 


23. 


875 


ATOM 


3859 


O 


MET 


B 


908 


23. 


4 65 


ATOM 


3860 


CB 


MET 


B 


908 


24. 


123 


ATOM 


3861 


CG 


MET 


B 


908 


24. 


7 64 


ATOM 


3862 


SD 


MET 


B 


908 


24. 


105 


ATOM 


3863 


CE 


MET 


B 


908 


23. 


828 


ATOM 


3864 


N 


MET 


B 


909 


23. 


240 


ATOM 


3865 


CA 


MET 


B 


909 


22. 


311 


ATOM 


3866 


C 


MET 


B 


909 


22. 


931 


ATOM 


3867 


O 


MET 


B 


909 


22. 


238 


ATOM 


3868 


CB 


MET 


B 


909 


21. 


999 


ATOM 


3869 


CG 


MET 


B 


909 


20. 


635 


ATOM 


3870 


SD 


MET 


B 


909 


20. 


003 


ATOM 


3871 


CE 


MET 


B 


909 


19. 


178 


ATOM 


3872 


N 


SER 


B 


910 


24. 


139 


ATOM 


3873 


CA 


SER 


B 


910 


24. 


818 


ATOM 


3874 


C 


SER 


B 


910 


24. 


819 


ATOM 


3875 


o 


SER 


B 


910 


24. 


685 


ATOM 


3876 


CB 


SER 


B 


910 


26. 


215 


ATOM 


3877 


OG 


SER 


B 


910 


26. 


607 


ATOM 


3878 


N 


GLU 


B 


911 


24. 


885 


ATOM 


3879 


CA 


GLU 


B 


911 


25. 


036 


ATOM 


3880 


C 


GLU 


B 


911 


23. 


754 


ATOM 


3881 


o 


GLU 


B 


911 


23. 


799 


ATOM 


3882 


CB 


GLU 


B 


911 


25. 


433 


ATOM 


3883 


CG 


GLU 


B 


911 


24. 


910 


ATOM 


3884 


CD 


GLU 


B 


911 


25. 


738 


ATOM 


3885 


OE1 


GLU 


B 


911 


26. 


386 


ATOM 


3886 


OE2 


GLU 


B 


911 


26. 


025 


ATOM 


3887 


N 


VAL 


B 


912 


22. 


690 


ATOM 


3888 


CA 


VAL 


B 


912 


21. 


398 


ATOM 


3889 


C 


VAL 


B 


S12 


21. 


064 


ATOM 


3890 


o 


VAL 


B 


912 


20. 


307 


ATOM 


3891 


CB 


VAL 


B 


912 


20. 


263 


ATOM 


3892 


CGI 


VAL 


B 


912 


20. 


679 


ATOM 


3893 


CG2 


VAL 


B 


912 


19. 


778 


ATOM 


3894 


N 


ILE 


B 


913 


21. 


300 


ATOM 


3895 


CA 


ILE 


B 


913 
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12.592 30.348 1.00 15.77 C 

11.974 30.361 1.00 9.47 O 

14.685 31.289 1.00 17.26 C 

15.433 30.757 1.00 18.79 C 

15.070 29.282 1.00 20.13 C 

12.105 30.712 1.00 17.16 N 

•10.803 31.380 1.00 18.98 C 

9.876 31.862 1.00 17.58 C 

8.743 31.404 1.00 14.04 O 

10.838 32.596 1.00 17.80 C 

11.036 32.357 1.00 19.85 C 

10.130 33.288 1.00 22.12 C 

8.886 33.145 1.00 13.68 OE2 O 

10.917 34.217 1.00 23.70 OE1 O 

10.236 32.804 1.00 18.45 N 

9.191 33.366 1.00 20.31 C 

8.830 32.313 1.00 22.17 C 

7.721 32.148 1.00 21.86 O 

9.520 34.632 1.00 17.73 C 

8.769 35.763 1.00 21.03 C 

9.373 37.312 1.00 24.57 S 
11.151 36.984 1.00 24.35 C 

9.882 31.808 1.00 26.09 N 

9.888 30.701 1.00 24.74 C 

9.202 29.491 1.00 22.49 C 

8.336 29.103 1.00 17.32 O 

11.352 30.455 1.00 25.60 C 

11.879 30.907 1.00 24.09 C 

12.624 29.404 1.00 19.42 S 

11.161 28.851 1.00 22.30 C 

9.374 29.041 1.00 22.97 N 
8.721 27.992 1.00 24.79 C 
7.218 28.173 1.00 26.58 C 
6.522 27.174 1.00 25.19 O 
9.246 27.924 1.00 22.80 C 
9.966 26.797 1.00 23.47 O 
6.764 29-400 1.00 27.63 N 
5.414 29-892 1.00 27.87 C 
4.612 29.716 1.00 24.73 C 
3.591 29.053 1.00 20.20 O 
5.239 31.369 1.00 30.73 C 
4.323 32.429 1.00 36.24 C 
3.916 33.652 1.00 43.49 ' C 
4.786 34.351 1.00 46.17 OE2 O 
2.758 34.173 1.00 45.61 OEl O 
5.068 30.324 1.00 23.14 N 
4.495 30.063 1.00 23.46 C 
4.435 28.586 1.00 23.40 C 
3.587 28.164 1.00 25.19 O 
5.230 30.775 1.00 21.45 C 
5.596 32.173 1.00 21.74 C 
6.516 30.084 1.00 22.10 C 
5.386 27.732 1.00 21.49 N 
5.468 26.345 1.00 20.13 C 
4.367 25.571 1.00 23.00 C 
3.666 24.729 1.00 22.70 O 
6.928 25.989 1.00 16.46 C 
7.629 26.412 1.00 14.29 C 
7.284 24.561 1.00 14.03 C 
9.102 26.249 1.00 13.21 C 
4.143 25.750 1.00 25.79 N 
3.200 25.004 1.00 27.23 C 
1.846 25.509 1.00 29.96 C 
1.033 24.611 1.00 32.65 O 
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3.144 25.248 1.00 24.52 C 

1.745 26.810 1.00 30.74 N 

0.551 27.483 1.00 32.59 C 

0.046 27.056 1.00 33.88 C 

-1.158 27.210 1.00 36.83 O 

0.649 29.037 1.00 31.42 C 

0.807 26.909 1.00 32.85 N 

0.097 26.762 1.00 31.20 C 

0.629 25.623 1.00 28.91 C 

-0.093 25.381 1.00 26.15 O 

-0.106 28.045 1.00 31.22 C 

-0.442 29.334 1.00 30.69 C 

-1.810 29.740 1.00 30.43 C 

-2.110 30.314 1.00 29.99 O 

-2.560 29.356 1.00 30.68 N 

1.635 24.862 1.00 24.75 N 

2.314 24.128 1.00 22.71 C 

1.722 22.751 1.00 24.08 C 

1.500 22.135 1.00 20.08 O 

3.745 24.313 1.00 19.74 C 

4.652 24.176 1.00 18.83 C 

4.226 25.208 1.00 18.41 C 

6.077 24.097 1.00 17.63 C 

1.302 22.131 1.00 26.23 N 

0.485 20.947 1.00 29.27 C 

-0.746 21.285 1.00 33.99 C 

-0.821 20.737 1.00 37.38 O 

0.279 20.455 1.00 26.37 C 

0.822 21.493 1.00 26.80 C 

1.461 22.599 1.00 25.13 C 

-1.706 22.106 1.00 36.86 N 

-2.804 22.628 1.00 36.57 C 

-2.421 23.097 1.00 33.26 C 

-3.170 22.670 1.00 33.52 O 

-3.519 23.731 1.00 39.69 C 

-3.855 25.088 l.OO 42.02 C 

-5.139 25.502 1.00 42.82 C 

-5.957 26.438 1.00 43.59 C 

-7.148 27.024 1.00 40.83 N 

-1.387 23.826 1.00 28.12 N 

-0.949 23.990 1.00 23.97 C 

-0.502 22.651 1.00 22.28 C 

-0.940 22.416 1.00 15.48 O 

0.102 25.100 1.00 21.67 C 

-0.238 26.447 1.00 20.06 C 

0.524 25.379 1.00 21.91 C 

0.916 27.283 1.00 16.22 C 

0.250 21.780 1.00 26.45 N 

0.672 20.539 1.00 32.29 C 

-0.527 19.591 1.00 32.33 C 

-0.596 18.990 1.00 32.64 O 

1.859 19.718 1.00 32.77 C 

3.299 20.264 1.00 34.41 C 

4.207 19.784 1.00 30.85 C 

4.035 19.948 1.00 33.18 C 

-1.550 19.508 1.00 30.57 N 

-2.757 18.792 1.00 29.80 C 

-3.585 19.325 1.00 29.13 C 

-4.689 18.819 1.00 24.51 O 

-3.639 18.830 1.00 31.19 C 

-3.139 20.266 1.00 31.21 N 

-3.908 20.980 1.00 33.85 C 

-5.074 21.870 1.00 34.21 C 

-6.006 22.246 1.00 32.50 O 
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